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NOVEL MOLECULES OF THE TANGO- 77 RELATED PROTEIN 
FAMILY AND USES THEREOF 

Background of the Invention 
The polypeptide cytokine interleukin-1 (IL-1) 
5 is a critical mediator of inflammatory and overall immune 
response. To date, three members of the IL-1 family, 
IL-la, IL-1/3 and IL-lra (Interleukin-1 receptor 
antagonist) have been isolated and cloned. IL-la and 
IL-ljS are proinflammatory cytokines which elicit 

10 biological responses, whereas IL-lra is an antagonist of 
IL-la and IL-ljS activity. Two distinct cell -surface 
receptors have been identified for these ligands, the 
type 1 IL-1 receptor (IL-lRtI) and type II IL-1 receptor 
(IL-lRtll) . Recent results suggest that the IL-lRtI is 

15 the receptor responsible for transducing a signal and 
producing biological effects. 

As mentioned above, IL-1 is a key mediator of the 
host inflammatory response. While inflammation is an 
important homeostatic mechanism, aberrant inflammation 

20 has the potential for inducing damage to the host. 

Elevated IL-1 levels are known to be associated with a 
number of diseases particularly autoimmune diseases and 
inflammatory disorders. 

Since Il-lra is a naturally occurring inhibitor of 

25 IL-1, IL-lra can be used to limit the aberrant and 

potentially deleterious effects of IL-1. In experimental 
animals, pretreatment with IL-lra has been shown to 
prevent death resulting from lipopolysaccharide- induced 
sepsis. The relative absence of IL-lra has also been 

30 suggested to play a role in human inflammatory bowel 
disease . 



Summary of the Invention 
The present invention is based, at least in part, 
on the discovery of a gene encoding Tango-77, a secreted 
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protein that is predicted to be a member of the cytokine 
superfamily. The Tango- 77 cDNA described below (SEQ ID 
NO:l) has three possible open reading frames. The first 
potential open reading frame encompasses 534 nucleotides 
5 extending from nucleotide 356 to nucleotide 889 of SEQ ID 
NO:l (SEQ ID NO: 3) and encodes a 178 amino acid protein 
(SEQ ID NO: 2) . This protein may include a predicted 
signal sequence of about 63 amino acids (from about amino 
acid 1 to about amino acid 63 of SEQ ID NO: 2 (SEQ ID 

10 NO: 4) and a predicted mature protein of about 115 amino 
acids (from about amino acid 64 to amino acid 178 of SEQ 
ID NO: 2 (SEQ ID NO: 5) ) . 

The second potential open reading frame 
encompasses 498 nucleotides extending from nucleotide 389 

15 to nucleotide 889 of SEQ ID NO:l (SEQ ID NO:6) and 
encodes a 167 amino acid protein (SEQ ID NO: 7) . This 
protein may include a predicted signal sequence of about 
52 amino acids (from about amino acid 1 to about amino 
acid 52 of SEQ ID NO: 7 (SEQ ID NO: 8)) and a predicted 

20 mature protein of about 115 amino acids (from about amino 
acid 52 to amino acid 167 of SEQ ID NO: 7 (SEQ ID NO: 9)) . 

The third potential open reading frame encompasses 
4 08 nucleotides extending from nucleotide 4 81 to 
nucleotide 889 of SEQ ID NO:l (SEQ ID NO: 10) and encodes 

25 a 136 amino acid protein (SEQ ID NO: 11) . This protein 
includes a predicted signal sequence of about 21 amino 
acids (from about amino acid 1 to about amino acid 21 of 
SEQ ID NO:ll (SEQ ID NO:12)) and a predicted mature 
protein of about 115 amino acids (from about amino acid 

30 22 to amino acid 136 of SEQ ID NO:ll (SEQ ID NO:13)) . 

As used herein, the terms "Tango-77", "Tango-77 
protein" , "Tango-77 polypeptide" amd the like, can refer 
and polypeptide produced by the cDNA of SEQ ID NO:l 
including any and all of the Tango-77 gene products 

35 described above. 
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Tango- 77 is expected to inhibit inflammation and 
play a functional role similar to that of secreted 
IL-lra. For example, it is expected that Tango- 77 may 
bind to the IL-1 receptor, thus blocking receptor 
5 activation by inhibiting the binding of IL-lo? and IL-1/3 
to the receptor. Alternatively, Tango-77 may inhibit 
inflammation through another pathway, for example, by 
binding to a novel receptor. Accordingly, Tango-77 may 
be useful as a modulating agent in regulating a variety 
10 of cellular processes including acute and chronic 

inflammation, e.g., asthma, chronic myelogenous leukemia, 
rheumatoid arthritis, psoriasis and inflammatory bowel 
disease . 

In one aspect, the invention provides isolated 

15 nucleic acid molecules encoding Tango-77 or biologically 
active portions thereof, as well as nucleic acid 
fragments suitable as primers or hybridization probes for 
the detection of Tango-77. 

The invention encompasses methods of diagnosing 

20 and treating patients who are suffering from a disorder 
associated with an abnormal level (undesirably high or 
undesirably low) of inflammation, abnormal activity of 
the IL-1 receptor complex, or abnormal activity of IL-1, 
by administering a compound that modulates the expression 

25 of Tango-77 (at the DNA, mRNA or protein level, e.g., by 
altering mRNA splicing) or by altering the activity of 
Tango-77. Examples of such compounds include small 
molecules, antisense nucleic acid molecules, ribozymes, 
and polypeptides. 

30 The invention features a nucleic acid molecule 

which is at least 45% (e.g., 55%, 65%, 75%, 85%, 95%, or 
98%) identical to the nucleotide sequence shown in SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID N0:6, SEQ ID NO:10, the 
nucleotide sequence of the cDNA insert of the plasmid 
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deposited with ATCC as Accession Number (the " cDNA of 
ATCC 98807"), or a complement thereof. 

The invention features a nucleic acid molecule 
which includes a fragment of at least 100 (e.g., 250, 
5 325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 
800, 900, or 989) nucleotides of the nucleotide sequence 
shown in SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID 
NO: 10, the nucleotide sequence of the cDNA ATCC 98807, or 
a complement thereof. 

10 The invention also features a nucleic acid 

molecule which includes a nucleotide sequence encoding a 
protein having an amino acid sequence that is at least 
45% (55%, 65%, 75%, 85%, 95%, or 98%) identical to the 
amino acid sequence of SEQ ID NO:2, SEQ ID NO: 5, SEQ ID 

15 NO: 7, SEQ ID NO : 9 , SEQ ID NO: 11, SEQ ID NO: 13, or the 
amino acid sequence encoded by the cDNA of ATCC 98807. 

In a preferred embodiment, a Tango- 77 nucleic acid 
molecule has the nucleotide sequence shown in SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID N0:6, SEQ ID NO:10 or the 

20 nucleotide sequence of the cDNA of ATCC 98807. 

Also within the invention is a nucleic acid 
molecule which encodes a fragment of a polypeptide having 
the amino acid sequence of SEQ ID NO:2, SEQ ID NO: 4, SEQ 
ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID 

25 NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, wherein the fragment 
includes at least 15 (e.g., 25, 30, 50, 100, 150, or 178) 
contiguous amino acids of SEQ ID NO: 2, SEQ ID NO: 4, SEQ 
ID NO:5, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID 
NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or the polypeptide 

30 encoded by the cDNA of ATCC Accession Number 98807. 

The invention includes a nucleic acid molecule 
which encodes a naturally occurring allelic variant of a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO:2, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:8, 

35 SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:13, or 
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an amino acid sequence encoded by the cDNA of ATCC 
Accession Number 98807, wherein the nucleic acid molecule 
hybridizes to a nucleic acid molecule comprising SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO:6, SEQ ID NO:10, or a 
5 complement thereof under stringent conditions. 



Tango-77 protein having an amino acid sequence that is at 
least about 45%, preferably 65%, 75%, 85%, 95%, or 98% 
identical to the amino acid sequence of SEQ ID NO: 5, SEQ 
10 ID NO:9 or SEQ ID NO:13 (mature human Tango-77), or the 
amino acid sequence. of SEQ ID NO: 2, SEQ ID NO: 7 or SEQ ID 
NO:ll (immature human Tango-77). 



Tango-77 protein which is encoded by a nucleic acid 
is molecule having a nucleotide sequence that is at least 
about 65%, preferably 75%, 85%, or 95% identical to SEQ 
ID NO:3, SEQ ID NO:6, SEQ ID NO:10 or the cDNA of ATCC 
98807; and an isolated Tango-77 protein which is encoded 
by a nucleic acid molecule having a nucleotide sequence 
20 which hybridizes under stringent hybridization conditions 
to a nucleic acid molecule having the nucleotide sequence 
of SEQ ID NO:3, SEQ ID NO: 6, SEQ ID NO: 10, the non- coding 
strand of the cDNA of ATCC 98807, or the complement 
thereof . 

25 Also within the invention is a polypeptide which 

is a naturally occurring allelic variant of a polypeptide 
that includes the amino acid sequence of SEQ ID NO: 2, SEQ 
ID NO:4, SEQ ID NO:5, SEQ ID N0:7, SEQ ID NO:8, SEQ ID 
NO:9, SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:13, or an 

30 amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with ATCC as Accession Number 98807, 
wherein the polypeptide is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid molecule 
comprising SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID 



Also within the invention are: an isolated 



Also within the invention are: an isolated 
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NO: 10 or the complement thereof under stringent 
conditions . 

Another embodiment of the invention features 
Tango- 77 nucleic acid molecules which specifically detect 
5 Tango- 77 nucleic acid molecules relative to nucleic acid 
molecules encoding other members of the cytokine 
superfamily. For example, in one embodiment, a Tango-77 
nucleic acid molecule hybridizes under stringent 
conditions to a nucleic acid molecule comprising the 

10 nucleotide sequence of SEQ ID NO:l, SEQ ID NO: 3, SEQ ID 
NO:6, SEQ ID NO:10, the cDNA of ATCC 98807, or a 
complement thereof. In another embodiment, the Tango-77 
nucleic acid molecule is at least 300 (325, 350, 375, 
400, 425, 450, 500, 550, 600, 650, 700, 800, 900, or 989) 

15 nucleotides in length and hybridizes under stringent 
conditions to a nucleic acid molecule comprising the 
nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO: 3, 
SEQ ID NO: 6, SEQ ID NO: 10, the cDNA of ATCC 98807, or a 
complement thereof. In yet another embodiment, the 

20 invention provides an isolated nucleic acid molecule 
which is antisense to the coding strand of a Tango-77 
nucleic acid. 

Another aspect of the invention provides a vector, 
e.g., a recombinant expression vector, comprising a 

25 Tango-77 nucleic acid molecule of the invention. In 
another embodiment, the invention provides a host cell 
containing such a vector. The invention also provides a 
method for producing Tango-77 protein by culturing, in a 
suitable medium, a host cell of the invention containing 

30 a recombinant expression vector such that a Tango-77 
protein is produced. 

Another aspect of this invention features isolated 
or recombinant Tango-77 proteins and polypeptides. 
Preferred Tango-77 proteins and polypeptides possess at 

35 least one biological activity possessed by naturally 
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occurring human Tango-77, e.g., (i) the ability to 
interact with proteins in the Tango-77 signalling pathway 
(ii) the ability to interact with a Tango-77 ligand or 
receptor; or (iii) the ability to interact with an 
5 intracellular target protein, (iv) the ability to 

interact with a protein involved in inflammation and (v) 
the ability to bind the IL-1 receptor. Other activities 
include the induction and suppression of polypeptide 
interleukins, cytokines and growth factors. 

10 The Tango-77 proteins of the present invention, or 

biologically active portions thereof, can be operably 
linked to a non-Tango-77 polypeptide (e.g., heterologous 
amino acid sequences) to form Tango-77 fusion proteins. 
The invention further features antibodies that 

is specifically bind Tango-77 proteins, such as monoclonal 
or polyclonal antibodies. In addition, the Tango-77 
proteins or biologically active portions thereof can be 
incorporated into pharmaceutical compositions, which 
optionally include pharmaceutically acceptable carriers. 

20 In another aspect, the present invention provides 

a method for detecting the presence of Tango-77 activity 
or expression in a biological sample by contacting the 
biological sample with an agent capable of detecting an 
indicator of Tango-77 activity or expression such that 

25 the presence of Tango-77 activity or expression is 
detected in the biological sample. 



for modulating Tango-77 activity comprising contacting a 
cell with an agent that modulates (inhibits or 
30 stimulates) 

Tango-77 activity or expression such that Tango-77 
activity or expression in the cell is modulated. In one 
embodiment, the agent is an antibody that specifically 
binds to Tango-77 protein. In another embodiment, the 



In another aspect, the invention provides a method 
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agent modulates expression of Tango-77 by modulating 
transcription of a Tango-77 gene, splicing of a Tango-77 
mRNA, or translation of a Tango-77 mRNA. In yet another 
embodiment, the agent is a nucleic acid molecule having a 
5 nucleotide sequence that is antisense to the coding 
strand of the Tango-77 mRNA or the Tango-77 gene. 



invention are used to treat a subject having a disorder 
characterized by aberrant Tango-77 protein activity or 

10 nucleic acid expression by administering an agent which 
is a Tango-77 modulator to the subject. In one 
embodiment, the Tango-77 modulator is a Tango-77 protein. 
In another embodiment, the Tango-77 modulator is a 
Tango-77 nucleic acid molecule. In other embodiments, 

is the Tango-77 modulator is a peptide, peptidomimetic, or 
other small molecule. In a preferred embodiment, the 
disorder characterized by aberrant Tango-77 protein or 
nucleic acid expression can include chronic and acute 
inflammation. 

20 The present invention also provides a diagnostic 

assay for identifying the presence or absence of a 
genetic lesion or mutation characterized by at least one 
of: (i) aberrant modification or mutation of a gene 
encoding a Tango-77 protein; (ii) mis-regulation of a 

25 gene encoding a Tango-77 protein; and (iii) aberrant 
post-translational modification of a Tango-77 protein, 
wherein a wild- type form of the gene encodes a protein 
with a Tango-77 activity. 



30 method for identifying a compound that binds to or 
modulates the activity of a Tango-77 protein. In 
general, such methods entail measuring a biological 
activity of a Tango-77 protein in the presence and 
absence of a test compound and identifying those 



In one embodiment, the methods of the present 



In another aspect, the invention provides a 
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compounds which alter the activity of the Tango- 77 
protein. 

The invention also features methods for 
identifying a compound which modulates the expression of 
5 Tango- 77 by measuring the expression of Tango- 77 in the 
presence and absence of a compound. 

Other features and advantages of the invention 
will be apparent from the following detailed description 
and claims . 

10 Brief Description of the Drawings 

Figure 1 depicts the cDNA sequence (SEQ ID N0:1) 
of Tango-77. The Tango-77 cDNA has three possible open 
reading frames which encode the amino acid sequence (SEQ 
ID N0:2, SEQ ID NO:7 and SEQ ID N0:11) of human Tango-77. 

is The three potential open reading frames of SEQ ID NO:l 
extend from: (1) nucleotide 356 to nucleotide 889 (SEQ ID 
NO: 3); (2) nucleotide 389 to nucleotide 889 (SEQ ID 
NO:6); and (3) nucleotide 481 to nucleotide 889 (SEQ ID 
NO:10) . 

20 Figure 2 depicts an alignment of an amino acid 

sequence of Tango-77 (T77; SEQ ID NO: 2) with IL-1RA (SEQ ' 
ID NO:14), and IL-1/3 (SEQ ID N0:15). 

Figure 3 depicts the genomic sequence of BAC1 (SEQ 
ID NO:16) . 

25 Figure 4 depicts the genomic sequence of BAC2 (SEQ 

ID NO: 17) . 

Figure 5 depicts an amino acid sequence of an 
alternatively spliced form of Tango-77 (SEQ ID NO:2) as 
predicted by Procrustes (T77-procrustes ; SEQ ID NO: 18) . 
30 Figure 6 depicts an alignment of an amino acid 

sequence of an alternatively spliced form of Tango-77 
(T77-procrustes; SEQ ID NO: 18) with Tango-77 (SEQ ID 
NO:2) . 
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Figure 7 depicts an alignment of an amino acid 
sequence of an alternatively spliced form of Tango-77 
(T77-procrustes; SEQ ID NO: 18) with IL-lra (SEQ ID 
NO:14), and IL-ljS (SEQ ID NO:15). 

5 Detailed Description of the Invention 

The present invention is based on the discovery of 
a cDNA molecule encoding human Tango-77, a member of the 
cytokine superfamily. The cDNA molecule encoding human 
Tango-77 has three possible open reading frames. The 

10 three possible nucleotide open reading frames for human 
Tango-77 protein are shown in Figure 1 (SEQ ID NO: 3, SEQ 
ID NO: 6 and SEQ ID NO: 10) . The predicted amino acid 
sequence for the three possible Tango-77 immature 
proteins are also shown in 

15 Figure 1 (SEQ ID NO:2, SEQ ID N0:7 or SEQ ID NO:ll) and 
three possible mature proteins are also shown in Figure 1 
(SEQ ID NO: 5, SEQ ID NO : 9 and SEQ ID NO: 13) . 

The Tango-77 cDNA of Figure 1 (SEQ ID NO:l), which 
is approximately 989 nucleotides long including 

20 untranslated regions, encodes a protein amino acid having 
a molecular weight of approximately 19 kDa, 18 kDa, or 
14.9 KDa (excluding post-translational modifications) and 
the possible mature form of the protein has a molecular 
weight of 13 kDa. A plasmid containing a cDNA encoding 

25 human Tango-77 (with the cDNA insert name of Of fthx077) 
was deposited with American Type Culture Collection 
(ATCC) , 10801 University Boulevard, Manassas, Virginia 
20110-2209 on July 2, 1998 and assigned Accession Number 
98807. This deposit will be maintained under the terms 

30 of the Budapest Treaty on the International Recognition 
of the Deposit of Microorganisms for the Purposes of 
Patent Procedure. This deposit was made merely as a 
convenience for those of skill in the art and is not an 
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admission that a deposit is required under 35 U.S.C. 
§112. 

Human Tango- 77 is one member of a family of 
molecules (the "Tango-77 family") having certain 
5 conserved structural and functional features. The term 
"family," when referring to the protein and nucleic acid 
molecules of the invention, is intended to mean two or 
more proteins or nucleic acid molecules having a common 
structural domain and having sufficient amino acid or 

10 nucleotide sequence identity as defined herein. Such 

family members can be naturally occurring and can be from - 
either the same or different species. For example, a 
family can contain a first protein of human origin and a 
homologue of that protein of murine origin, as well as a 

15 second, distinct protein of human origin and a murine 
homologue of that protein. Members of a family may also 
have common functional characteristics. 



activity", "biological activity of Tango-77" or 
20 "functional activity of Tango-77", refers to an activity 
exerted by a Tango-77 protein, polypeptide or nucleic 
acid molecule on a Tango-77 responsive cell as determined 
in vivo, or in vitro, according to standard techniques. 
A Tango-77 activity can be a direct activity, such as an 
25 association with a second protein, or an indirect 

activity, such as a cellular signaling activity mediated 
by interaction of the Tango-77 protein with a second 
protein. In a preferred embodiment, a Tango-77 activity 
includes at least one or more of the following 
30 activities: (i) the ability to interact with proteins in 
the Tango-77 signalling pathway (ii) the ability to 
interact with a Tango-77 ligand or receptor; or (iii) the 
ability to interact with an intracellular target protein, 
(iv) the ability to interact with a protein involved in 



As used interchangeably herein a "Tango-77 
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inflammation, and (v) the ability to bind the IL-1 
receptor. 

Accordingly, another embodiment of the invention 
features isolated Tango-77 proteins and polypeptides 
5 having a Tango-77 activity. 

Yet another embodiment of the invention features 
Tango-77 molecules which contain a signal sequence. 
Generally, a signal sequence (or signal peptide) is a 
peptide containing about 21 to 63 amino acids which 

10 occurs at the extreme N- terminal end of a secretory 
protein. The native Tango-77. signal sequence (SEQ ID 
NO: 4, SEQ ID NO: 8, or SEQ ID NO: 12) can be removed and 
replaced with a signal sequence from another protein. In 
certain host cells (e.g., mammalian host cells), 

is expression and/or secretion of Tango-77 can be increased 
through use of a heterologous signal sequence . For 
example, the gp67 secretory sequence of the baculovirus 
envelope protein can be used as a heterologous signal 
sequence. Alternatively, the native Tango-77 signal 

20 sequence can itself be used as a heterologous signal 
sequence in expression systems, e.g., to facilitate the 
secretion of a protein of interest. 

Various aspects of the invention are described in 
further detail in the following subsections. 

25 I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated 
nucleic acid molecules that encode Tango-77 proteins or 
biologically active portions thereof, as well as nucleic 
acid molecules sufficient for use as hybridization probes 

30 to identify Tango- 77 -encoding nucleic acids (e.g., 

Tango-77 mRNA) and fragments for use as PGR primers for 
the amplification or mutation of Tango-77 nucleic acid 
molecules. As used herein, the term "nucleic acid 
molecule" is intended to include DNA molecules (e.g., 
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cDNA or genomic DNA) and RNA molecules (e.g., mRNA) and 
analogs of the DNA or RNA generated using nucleotide 
analogs. The nucleic acid molecule can be single- 
stranded or double -stranded, but preferably is double - 
5 stranded DNA. 



is separated from other nucleic acid molecules which are 
present in the natural source of the nucleic acid. 
Preferably, an "isolated" nucleic acid is free of 

10 sequences (preferably protein encoding sequences) which 
naturally flank the nucleic acid (i.e., sequences located 
at the 5' and 3' ends of the nucleic acid) in the genomic 
DNA of the organism from which the nucleic acid is 
derived. For example, in various embodiments, the 

is isolated Tango-77 nucleic acid molecule can contain less 
than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0 . 5 kb or 0 . 1 kb 
of nucleotide sequences which naturally flank the nucleic 
acid molecule in genomic DNA of the cell from which the 
nucleic acid is derived. Moreover, an "isolated" nucleic 

20 acid molecule, such as a cDNA molecule, can be 

substantially free of other cellular material, or culture 
medium when produced by recombinant techniques, or 
substantially free of chemical precursors or other 
chemicals when chemically synthesized. 

25 A nucleic acid molecule of the present invention, 

e.g., a nucleic acid molecule having the nucleotide 
sequence of SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID 
NO: 10, the cDNA of ATCC 98807, or a complement of any of 
these nucleotide sequences, can be isolated using 

30 standard molecular biology techniques and the sequence 
information provided herein. Using all or a portion of 
the nucleic acid sequences of SEQ ID NO:l, SEQ ID NO: 3, 
SEQ ID NO: 6, SEQ ID NO: 10, the cDNA of ATCC 98807, or the 
complement thereof as a hybridization probe, Tango-77 

35 nucleic acid molecules can be isolated using standard 



An "isolated" nucleic acid molecule is one which 
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hybridization and cloning techniques (e.g., as described 
in Sambrook et al . , eds., Molecular Cloning: A 
Laboratory Manual, 2nd ed., Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold 
5 Spring Harbor, NY, 1989) . 

A nucleic acid of the invention can be amplified 
using cDNA, mRNA or genomic DNA as a template and 
appropriate oligonucleotide primers according to standard 
PCR amplification techniques. The nucleic acid so 

10 amplified can be cloned into an appropriate vector and 
characterized by DNA sequence analysis . Furthermore, 
oligonucleotides corresponding to Tango- 77 nucleotide 
sequences can be prepared by standard synthetic 
techniques, e.g., using an automated DNA synthesizer. 

15 In another preferred embodiment, an isolated 

nucleic acid molecule of the invention comprises a 
nucleic acid molecule which is a complement of the 
nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO: 3, 
SEQ ID NO: 6, SEQ ID NO: 10 the cDNA of ATCC 98807, or a 

20 portion thereof. A nucleic acid molecule which is 

complementary to a given nucleotide sequence is one which 
is sufficiently complementary to the given nucleotide 
sequence that it can hybridize to the given nucleotide 
sequence thereby forming a stable duplex. 

25 Moreover, the nucleic acid molecule of the 

invention can comprise only a portion of a nucleic acid 
sequence encoding Tango- 77, for example, a fragment which 
can be used as a probe or primer or a fragment encoding a 
biologically active portion of Tango-77. The nucleotide 

30 sequence determined from the cloning of the human 

Tango-77 gene allows for the generation of probes and 
primers designed for use in identifying and/or cloning 
Tango-77 homologues in other cell types, e.g., from other 
tissues, as well as Tango-77 homologues from other 

35 mammals. The probe/primer typically comprises 
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substantially purified oligonucleotide. The 
oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent 
conditions to at least about 12, preferably about 25, 
5 more preferably about 50, 75, 100, 125, 150, 175, 200, 
250, 300, 350 or 400 consecutive nucleotides of the sense 
or anti- sense sequence of SEQ ID N0:1, SEQ ID NO: 3, SEQ 
ID NO: 6, SEQ ID NO: 10, or the cDNA of ATCC 98807. 
Alternatively, the oligonucleotide can typically comprise 
10 a region of nucleotide sequence that hybridizes under 

stringent conditions to at least about 12, preferably 

. , fr. 

about 25, more preferably about 50, 75, 100, 125, 150, 
175, 200, 250, 300, 350 or 400 consecutive nucleotides of 
the sense or anti -sense sequence of a naturally occurring 

15 mutant of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:6, SEQ ID 
NO: 10, or the cDNA of ATCC 98807. 

Probes based on the human Tango- 77 nucleotide 
sequence can be used to detect transcripts or genomic 
sequences encoding the same or identical proteins. The 

20 probe comprises a label group attached thereto, e.g., a 
radioisotope, a fluorescent compound, an enzyme, or an 
enzyme co- factor. Such probes can be used as a part of a 
diagnostic test kit for identifying cells or tissues 
which mis-express a Tango-77 protein, such as by 

25 measuring a level of a Tango- 77 -encoding nucleic acid in 
a sample of cells from a subject, e.g., detecting 
Tango-77 mRNA levels or determining whether a genomic 
Tango-77 gene has been mutated or deleted. 

A nucleic acid fragment encoding a "biologically 

30 active portion of Tango-77" can be prepared by isolating 
a portion of SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO: 6, SEQ 
ID NO: 10 or the nucleotide sequence of the cDNA of ATCC 
98807 which encodes a polypeptide having a Tango-77 
biological activity, expressing the encoded portion of 

35 Tango-77 protein (e.g., by recombinant expression in 
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vitro) and assessing the activity of the encoded portion 
of Tango- 77. 

The invention further encompasses nucleic acid 
molecules that differ from the nucleotide sequence of SEQ 
5 ID NO:l, SEQ ID NO:3, SEQ ID NO:6, SEQ ID NO:10, or the 
cDNA of ATCC 98807 due to degeneracy of the genetic code 
and thus encode the same Tango-77 protein as that encoded 
by the nucleotide sequence shown in SEQ ID NO:l, SEQ ID 
NO:3, SEQ ID NO:6, SEQ ID NO:10, or the cDNA of ATCC 
10 98807. 

In addition to the human Tango-77 nucleotide 
sequence shown in SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO: 6, 
SEQ ID NO: 10, or the cDNA of ATCC 98807, it will be 
appreciated by those skilled in the art that DNA sequence 

15 polymorphisms that lead to changes in the amino acid 
sequences of Tango-77 may exist within a population 
(e.g., the human population). Such genetic polymorphism 
in the Tango-77 gene may exist among individuals within a 
population due to natural allelic variation. An allele 

20 is one of a group of genes which occur alternatively at a 
given genetic locus. As used herein, the term "allelic 
variant" refers to a nucleotide sequence which occurs at 
a Tango-77 locus or to a polypeptide encoded by the 
nucleotide sequence. As used herein, the terms "gene" 

25 and "recombinant gene" refer to nucleic acid molecules 
comprising an open reading frame encoding a Tango-77 
protein, preferably a mammalian Tango-77 protein. Such 
natural allelic variations can typically result in 1-5% 
variance in the nucleotide sequence of the Tango-77 gene. 

30 Alternative alleles can be identified by sequencing the 
gene of interest in a number of different individuals. 
This can be readily carried out by using hybridization 
probes to identify the same genetic locus in a variety of 
individuals. Any and all such nucleotide variations and 

35 resulting amino acid polymorphisms or variations in 



WO 99/06426 



PCT/US98/16102 



- 17 - 

Tango- 77 that are the result of natural allelic variation 
and that do not alter the functional activity of Tango- 77 
are intended to be within the scope of the invention. 

Moreover, nucleic acid molecules encoding Tango- 77 
5 proteins from other species (Tango- 77 homologues) , which 
have a nucleotide sequence which differs from that of a 
human Tango-77, are intended to be within the scope of 
the invention. Nucleic acid molecules corresponding to 
natural allelic variants and homologues of the Tango-77 

10 cDNA of the invention can be isolated based on their 
identity to the human Tango-77 nucleic acids disclosed 
herein using the human cDNAs, or a portion thereof, as a 
hybridization probe according to standard hybridization 
techniques under stringent hybridization conditions. 

15 Accordingly, in another embodiment, an isolated 

nucleic acid molecule of the invention is at least 3 00 
(325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 
800, or 989) nucleotides in length and hybridizes under 
stringent conditions to the nucleic acid molecule 

20 comprising the nucleotide sequence, preferably the coding 
sequence, of SEQ ID N0:1, SEQ ID NO: 3, SEQ ID NO: 6, SEQ 
ID NO: 10, or the cDNA of ATCC 98807. 

As used herein, the term "hybridizes under 
stringent conditions" is intended to describe conditions 

25 for hybridization and washing under which nucleotide 
sequences at least 60% (65%, 70%, preferably 75%) 
identical to each other typically remain hybridized to 
each other. Such stringent conditions are known to those 
skilled in the art and can be found in Current Protocols 

30 in Molecular Biology, John Wiley & Sons, N.Y. (1989), 
6.3.1-6.3.6. A preferred, non-limiting example of 
stringent hybridization conditions are hybridization in 
6X sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by one or more washes in 0.2X SSC, 0.1% SDS at 

35 50-65°C. Preferably, an isolated nucleic acid molecule 
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of the invention that hybridizes under stringent 
conditions to the sequence of SEQ ID NO:l, SEQ ID NO: 3, 
SEQ ID NO: 6, SEQ ID NO: 10, the cDNA of ATCC 98807, or the 
complement thereof, corresponds to a naturally-occurring 
5 nucleic acid molecule. As used herein, a "naturally- 
occurring" nucleic acid molecule refers to an RNA or DNA 
molecule having a nucleotide sequence that occurs in 
nature (e.g., encodes a natural protein). 



10 variants of the Tango- 77 sequence that may exist in the 
population, the skilled artisan will further appreciate 
that changes can be introduced by mutation into the 
nucleotide sequence of SEQ ID NO:l, SEQ ID NO: 3, SEQ ID 
NO: 6, SEQ ID NO: 10 or the cDNA of ATCC 98807, thereby 

15 leading to changes in the amino acid sequence of the 

encoded Tango-77 protein, without altering the biological 
activity of the Tango- 77 protein. Amino acid residues 
that are not conserved or only semiconserved among Tango- 
77 of various species may be non-essential for activity 

20 and thus would likely be targets for alteration. 

Alternatively, one can make nucleotide substitutions 
leading to amino acid substitutions at "non-essential" 
amino acid residues. A "non-essential" amino acid 
residue is a residue that can be altered from the wild- 

25 type sequence of Tango-77 (e.g., the sequence of SEQ ID 
NO:2, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 11 
or SEQ ID NO: 13) without altering the biological 
activity, whereas an "essential" amino acid residue is 
required for biological activity. For example, amino 

30 acid residues that are conserved among the Tango-77 

proteins of various species may be essential for activity 
and thus would not likely be targets for alteration, 
unless one wishes to reduce or alter Tango-77 activity. 



35 pertains to nucleic acid molecules encoding Tango-77 



In addition to naturally-occurring allelic 



Accordingly, another aspect of the invention 





WO 99/06426 



PCTAJS98/16102 



- 19 - 



proteins that contain changes in amino acid residues that 
are not essential for activity. Such Tango- 77 proteins 
differ in amino acid sequence from SEQ ID NO: 2, SEQ ID 
NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, or SEQ ID 
5 NO: 13 yet retain biological activity. In one embodiment, 
the isolated nucleic acid molecule includes a nucleotide 
sequence encoding a protein that includes an amino acid 
sequence that is at least about 45% identical, 65%, 75%, 
85%, 95%, or 98% identical to the amino acid sequence of 
10 SEQ ID N0:2, SEQ ID NO:5, SEQ ID N0:7, SEQ ID NO:9, SEQ 
ID NO: 11, or SEQ ID NO: 13. 



Tango-77 protein having a sequence which differs from 
that of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:7, SEQ ID 

15 NO: 9, SEQ ID NO: 11, or SEQ ID NO: 13 can be created by 
introducing one or more nucleotide substitutions, 
additions or deletions into the nucleotide sequence of 
SEQ ID NO:l, SEQ ID N0:3, SEQ ID N0:6, SEQ ID NO: 10, or 
the cDNA of ATCC 98807 such that one or more amino acid 

20 substitutions, additions or deletions are introduced into 
the encoded protein. Mutations can be introduced by 
standard techniques, such as site-directed mutagenesis 
and PCR-mediated mutagenesis. Preferably, conservative 
amino acid substitutions are made at one or more 

25 predicted non-essential amino acid residues. A 

"conservative amino acid substitution" is one in which 
the amino acid residue is replaced with an amino acid 
residue having a similar side chain. Families of amino 
acid residues having similar side chains have been 

30 defined in the art. These families include amino acids 
with basic side chains (e.g., lysine, arginine, 
histidine) , acidic side chains (e.g., aspartic acid, 
glutamic acid), uncharged polar side chains (e.g., 
glycine, asparagine, glutamine, serine, threonine, 

35 tyrosine, cysteine), nonpolar side chains (e.g., alanine, 



An isolated nucleic acid molecule encoding a 



• 
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valine , leucine, isoleucine, proline, phenylalanine, 
methionine, tryptophan), beta -branched side chains (e.g., 
threonine, valine, isoleucine) and aromatic side chains 
(e.g., tyrosine, phenylalanine, tryptophan, histidine) . 
5 Thus, a predicted nonessential amino acid residue in 
Tango- 77 is preferably replaced with another amino acid 
residue from the same side chain family. Alternatively, 
mutations can be introduced randomly along all or part of 
a Tango- 77 coding sequence, such as by saturation 

10 mutagenesis, and the resultant mutants can be screened 
for Tango-77 biological activity to identify mutants that 
retain activity. Following mutagenesis, the encoded 
protein can be expressed recombinantly and the activity 
of the protein can be determined. 

15 In a preferred embodiment, a mutant Tango-77 

protein can be assayed for: (1) the ability to form 
protein: protein interactions with proteins in the 
Tango-77 signalling pathway; (2) the ability to bind a 
Tango-77 ligand or receptor; or (3) the ability to bind 

20 to an intracellular target protein or (4) the ability to 
interact with a protein involved in inflammation or (5) 
the ability to bind the IL-1 receptor. In yet another 
preferred embodiment, a mutant Tango-77 can be assayed 
for the ability to modulate inflammation, asthma, 

25 autoimmune dieseases, and sepsis. 



complementary to a sense nucleic acid encoding a protein, 
e.g., complementary to the coding strand of a double- 

30 stranded cDNA molecule or complementary to an mRNA 
sequence. Accordingly, an antisense nucleic acid can 
hydrogen bond to a sense nucleic acid. The antisense 
nucleic acid can be complementary to an entire Tango-77 
coding strand, or to only a portion thereof, e.g., all or 

35 part of the protein coding region (or open reading 



The present invention encompasses antisense 
nucleic acid molecules, i.e., molecules which are 
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frame) . An antisense nucleic acid molecule can be 
antisense to a noncoding region of the coding strand of a 
nucleotide sequence encoding Tango- 77. The noncoding 
regions ("5' and 3' untranslated regions") are the 5' and 
5 3' sequences which flank the coding region and are not 
translated into amino acids. 



Tango- 77 disclosed herein (e.g. # SEQ ID NO: 3, SEQ ID NO: 5, 
or SEQ ID NO: 8), antisense nucleic acids of the invention 

10 can be designed according to the rules of Watson and 

Crick base pairing. The antisense nucleic acid molecule 
can be complementary to the entire coding region of 
Tango- 77 mRNA, but more preferably is an oligonucleotide 
which is antisense to only a portion of the coding or 

15 noncoding region of Tango- 77 mRNA. For example, the 
antisense oligonucleotide can be complementary to the 
region surrounding the translation start site of Tango-77 
mRNA, e.g., an oligonucleotide having the sequence 
5' -TGCAACTTTTACAGGAAACAC-3' (SEQ ID NO: 19) or 

20 5 ' -CCTCACTTTTACCCGAGACTC-3 ' (SEQ ID NO: 20) or 

5 ' -GACGGGTGGTACTTAAAACAA-3 ' (SEQ ID NO: 21). An antisense 
oligonucleotide can be, for example, about 5, 10, 15, 20, 
25, 30, 35, 40, 45 or 50 nucleotides in length. An 
antisense nucleic acid of the invention can be 

25 constructed using chemical synthesis and enzymatic 
ligation reactions using procedures known in the art. 
For example, an antisense nucleic acid (e.g., an 
antisense oligonucleotide) can be chemically synthesized 
using naturally occurring nucleotides or variously 

30 modified nucleotides designed to increase the biological 
stability of the molecules or to increase the physical 
stability of the duplex formed between the antisense and 
sense nucleic acids, e.g., phosphorothioate derivatives 
and acridine substituted nucleotides can be used. 

35 Examples of modified nucleotides which can be used to 



Given the coding strand sequences encoding 
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generate the antisense nucleic acid include 5- 
f luorouracil , 5-bromouracil , 5-chlorouracil , 
5 - iodouraci 1 , hypoxanthine , xanthine , 4 -acetylcytos ine , 
5- (carboxyhydroxylmethyl) uracil, 5- 
5 carboxymethylaminomethyl - 2 - thiouridine , 

5-carboxymethylaminomethyluracil , dihydrouracil , 
beta-D-galactosylqueosine, inosine, N6- 
isopentenyladenine, 1-methylguanine, 1-methylinosine, 
2 , 2 -dimethylguanine , 2 -methyladenine , 2 -methyl guanine , 3 - 

10 methylcytosine, 5 -methylcytosine , N6 -adenine, 7- 
methylguanine , 5 -me thylaminomethyluracil , 5 - 
methoxyaminomethyl - 2 - thiouracil , bet a-D-mannosylqueosine , 
5 1 -methoxycarboxymethyluracil, 5-methoxyuracil , 2- 
methylthio-N6 - isopentenyladenine , uracil - 5 -oxyacetic acid 

15 (v) , wybutoxosine, pseudouracil, queosine, 2- 

thiocytosine , 5 -methyl - 2 -thiouracil , 2 -thiouracil , 4 - 
thiouracil, 5-methyluracil , uracil -5 -oxyacetic acid 
methylester, uracil -5 -oxyacetic acid (v) , 5-methyl-2- 
thiouracil, 3- (3-amino-3-N-2-carboxypropyl) uracil 

20 (acp3)w, and 2 , 6-diaminopurine . Alternatively, the 

antisense nucleic acid can be produced biologically using 
an expression vector into which a nucleic acid has been 
subcloned in an antisense orientation (i.e., RNA 
transcribed from the inserted nucleic acid will be of an 

25 antisense orientation to a target nucleic acid of 

interest, described further in the following subsection) . 



invention are typically administered to a subject or 
generated in situ such that they hybridize with or bind 

30 to cellular mRNA and/or genomic DNA encoding a Tango- 77 
protein to thereby inhibit expression of the protein, 
e -9-/ b Y inhibiting transcription and/or translation. 
The hybridization can be by conventional nucleotide 
complementarity to form a stable duplex, or, for example, 

35 in the case of an antisense nucleic acid molecule which 



The antisense nucleic acid molecules of the 
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binds to DNA duplexes, through specific interactions in 
the major groove of the double helix. An example of a 
route of administration of antisense nucleic acid 
molecules of the invention includes direct injection at a 
5 tissue site. Alternatively, antisense nucleic acid 
molecules can be modified to target selected cells and 
then administered systemically . For example, for 
systemic administration, antisense molecules can be 
modified such that they specifically bind to receptors or 

10 antigens expressed on a selected cell surface, e.g., by 
linking the antisense nucleic acid molecules to peptides 
or antibodies which bind to cell surface receptors or 
antigens. The antisense nucleic acid molecules can also 
be delivered to cells using the vectors described herein. 

15 To achieve sufficient intracellular concentrations of the 
antisense molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the 
control of a strong pol II or pol III promoter are 
preferred. 

20 An antisense nucleic acid molecule of the 

invention can be an a-anomeric nucleic acid molecule. An 
a-anomeric nucleic acid molecule forms specific double- 
stranded hybrids with complementary RNA in which, 
contrary to the usual /3 -units, the strands run parallel 

25 to each other (Gaultier et al . (1987) Nucleic Acids Res. 
15:6625-6641). The antisense nucleic acid molecule can 
also comprise a 2 ' -o-methylribonucleotide (Inoue et al . 
(1987) Nucleic Acids Res. 15:6131-6148) or a chimeric 
RNA-DNA analogue (Inoue et al . (1987) FEBS Lett. 215:327- 

30 330) . 



Ribozymes are catalytic RNA molecules with ribonuclease 
activity which are capable of cleaving a single-stranded 
nucleic acid, such as an mRNA, to which they have a 
35 complementary region. Thus, ribozymes (e.g., hammerhead 



The invention also encompasses ribozymes. 
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ribozymes (described in Haselhoff and Gerlach (1988) 
Nature 334:585-591)) can be used to catalytically cleave 
Tango -77 mRNA transcripts to thereby inhibit translation 
of Tango- 77 mRNA. A ribozyme having specificity for a 
5 Tango- 77 -encoding nucleic acid can be designed based upon 
the nucleotide sequence of a Tango-77 cDNA disclosed 
herein (e.g., SEQ ID NO:l, SEQ ID NO:3, SEQ ID 130:6, SEQ 
, ID NO: 10). For example, a derivative of a Tetrahymena L- 
19 IVS RNA can be constructed in which the nucleotide 

10 sequence of the active site is complementary to the 

nucleotide sequence to be cleaved in a Tango-77-encoding 
mRNA. See, e.g., Cech et al . U.S. Patent No. 4,987,071; 
and Cech et al . U.S. Patent No. 5,116,742. 
Alternatively, Tango-77 mRNA can be used to select a 

15 catalytic RNA having a specific ribonuclease activity 
from a pool of RNA molecules. See, e.g., Bartel and 
Szostak (1993) Science 261:1411-1418. 



molecules which form triple helical structures. For 
20 example, Tango-77 gene expression can be inhibited by 
targeting nucleotide sequences complementary to the 
regulatory region of the Tango-77 (e.g., the Tango-77 
promoter and/or enhancers) to form triple helical 
structures that prevent transcription of the Tango-77 
25 gene in target cells. See generally, Helene (1991) 

Anticancer Drug Des. 6(6):569-84; Helene (1992) Ann. N.Y. 
Acad. Sci. 660:27-36; and Maher (1992) Bioassays 
14 (12) :807-15. 



30 molecules of the invention can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, 
e.g., the stability, hybridization, or solubility of the 
molecule. For example, the deoxyribose phosphate 
backbone of the nucleic acids can be modified to generate 

35 peptide nucleic acids (see Hyirup et al. (1996) Bioorganic 



The invention also encompasses nucleic acid 



In preferred embodiments, the nucleic acid 
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& Medicinal Chemistry 4(1) : 5-23) . As used herein, the 
terms "peptide nucleic acids" or "PNAs" refer to nucleic 
acid mimics, e.g., DNA mimics, in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide 
5 backbone and only the four natural nucleobases are 

retained. The neutral backbone of PNAs has been shown to 
allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA 
oligomers can be performed using standard solid phase 

10 peptide synthesis protocols as described in Hyrup et al . 
(1996) supra; Perry-O' Keef e et al . (1996) Proc. Natl. 
Acad. Sci. USA 93: 14670-675. 

PNAs of Tango- 77 can be used in therapeutic and 
diagnostic applications. For example, PNAs can be used 

15 as antisense or antigene agents for sequence-specific 
modulation of gene expression by, e.g., inducing 
transcription or translation arrest or inhibiting 
replication. PNAs of Tango-77 can also be used, e.g., in 
the analysis of single base pair mutations in a gene by, 

20 e.g., PNA directed PGR clamping; as artificial 

restriction enzymes when used in combination with other 
enzymes, e.g., SI nucleases (Hyrup (1996) supra; or as 
probes or primers for DNA sequence and hybridization 
(Hyrup (1996) supra; Perry-O' Keef e et al . (1996) Proc. 

25 Natl. Acad. Sci. USA 93: 14670-675). 

In another embodiment, PNAs of Tango-77 can be 
modified, e.g., to enhance their stability or cellular 
uptake, by attaching lipophilic or other helper groups to 
PNA, by the formation of PNA -DNA chimeras, or by the use 

30 of liposomes or other techniques of drug delivery known 
in the art. For example, PNA -DNA chimeras of Tango-77 
can be generated which may combine the advantageous 
properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes, e.g., RNAse H and DNA polymerases, 

35 to interact with the DNA portion while the PNA portion 



WO 99/06426 



PCT/US98/16102 



- 26 - 

would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of 
appropriate lengths selected in terms of base stacking, 
number of bonds between the nucleobases, and orientation 
5 (Hyrup (1996) supra) . The synthesis of PNA-DNA chimeras 
can be performed as described in Hyrup (1996) supra and 
Finn et al . (1996) Nucleic Acids Res. 24 (17) : 3357-63 . 
For example, a DNA chain can be synthesized on a solid 
support using standard phosphor amidite coupling chemistry 

10 and modified nucleoside analogs. Compounds such as 5'- 
(4-methoxytrityl) amino-5' -deoxy- thymidine phosphoramidite 
can be used as a link between the PNA and the 5' end of 
DNA (Mag et al . (1989) Nucleic Acid Res. 17:5973-88). 
PNA monomers are then coupled in a stepwise manner to 

15 produce a chimeric molecule with a 5' PNA segment and a 
3' DNA segment (Finn et al . (1996) Nucleic Acids Res. 
24 (17) :3357-63) . Alternatively, chimeric molecules can 
be synthesized with a 5' DNA segment and a 3' PNA segment 
(Peterser et al . (1975) Bioorganic Med. Chem. Lett. 

20 5:1119-11124) . 

In other embodiments, the oligonucleotide may 
include other appended groups such as peptides (e.g., for 
targeting host cell receptors in vivo) , or agents 
facilitating transport across the cell membrane (see f 

25 e.g., Letsinger et al. (1989) Proc. Natl. Acad. Sci . USA 
86:6553-6556; Lemaitre et al . (1987) Proc. Natl. Acad. 
Sci. USA 84:648-652; PCT Publication No. WO 88/09810) or 
the blood-brain barrier (see, e.g., PCT Publication No. 
W0 89/10134) . In addition, oligonucleotides can be 

30 modified with hybridization-triggered cleavage agents 
(see, e.g., Krol et al . (1988) Bio/Techniques 6:958-976) 
or intercalating agents [see, e.g., Zon (1988) Pharm. 
Res. 5:539-549). To this end, the oligonucleotide may be 
conjugated to another molecule, e.g., a peptide, 
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hybridization triggered cross-linking agent, transport 
agent, hybridization- triggered cleavage agent, etc. 

II. Isolated Tanqo-77 Proteins and Anti-Tanqo-77 
Antibodies 

5 One aspect of the invention pertains to isolated 

Tango- 77 proteins, and biologically active portions 
thereof, as well as polypeptide fragments suitable for 
use as immunogens to raise anti-Tango-77 antibodies. In 
one embodiment, native Tango-77 proteins can be isolated 

10 from cells or tissue sources by an appropriate 

purification scheme using standard protein purification 
techniques. In another embodiment, Tango-77 proteins are 
produced by recombinant DNA techniques. Alternative to 
recombinant expression, a Tango-77 protein or polypeptide 

15 can be synthesized chemically using standard peptide 
synthesis techniques. 

An "isolated" or "purified" protein or 
biologically active portion thereof is substantially free 
of cellular material or other contaminating proteins from 

20 the cell or tissue source from which the Tango-77 protein 
is derived, or substantially free of chemical precursors 
or other chemicals when chemically synthesized. The 
language "substantially free of cellular material" 
includes preparations of Tango-77 protein in which the 

25 protein is separated from cellular components of the 
cells from which it is isolated or recombinantly 
produced. Thus, Tango-77 protein that is substantially 
free of cellular material includes preparations of 
Tango-77 protein having less than about 30%, 20%, 10%, or 

30 5% (by dry weight) of non-Tango-77 protein (also referred 
to herein as a "contaminating protein") . When the 
Tango-77 protein or biologically active portion thereof 
is recombinantly produced, it is also preferably 
substantially free of culture medium, i.e., culture 
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medium represents less than about 20%, 10%, or 5% of the 
volume of the protein preparation. When Tango- 77 protein 
is produced by chemical synthesis, it is preferably 
substantially free of chemical precursors or other 
5 chemicals, i.e., it is separated from chemical precursors 
or other chemicals which are involved in the synthesis of 
the protein. Accordingly such preparations of Tango- 77 
protein have less than about 30%, 20%, 10%, 5% (by dry 
weight) of chemical precursors or non-Tango- 77 chemicals. 

10 Biologically active portions of a Tango- 77 protein 

include peptides^ comprising amino acid sequences 
sufficiently identical to or derived from the amino acid 
sequence of the Tango-77 protein (e.g., the amino acid 
sequence shown in SEQ ID NO: 2, SEQ ID NO: 5, SEQ ID N0:7, 

15 SEQ ID NO: 9, SEQ ID NO: 11, or SEQ ID N0:13), which 

include fewer amino acids than the full length Tango-77 
proteins, and exhibit at least one activity of a Tango-77 
protein. Typically, biologically active portions 
comprise a domain or motif with at least one activity of 

20 the Tango-77 protein. A biologically active portion of a 
Tango-77 protein can be a polypeptide which is, for 
example, 10, 25, 50, 100 or more amino acids in length. 

Moreover, other biologically active portions, in 
which other regions of the protein are deleted, can be 

25 prepared by recombinant techniques and evaluated for one 
or more of the functional activities of a native Tango-77 
protein. 

Preferred Tango-77 protein has the amino acid 
sequence shown of SEQ ID NO:2, SEQ ID NO: 5, SEQ ID NO: 7, 

30 SEQ ID NO: 9, SEQ ID NO: 11, or SEQ ID NO: 13. Other useful 
Tango-77 proteins are substantially identical to SEQ ID 
NO:2, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID 
NO: 11, or SEQ ID NO: 13 and retain the functional activity 
of the protein of SEQ ID NO:2, SEQ ID NO: 5, SEQ ID NO: 7, 

35 SEQ ID NO:9, SEQ ID NO: 11, or SEQ ID NO:13 yet differ in 
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amino acid sequence due to natural allelic variation or 
mutagenesis. Accordingly, a useful Tango-77 protein is a 
protein which includes an amino acid sequence at least 
about 45%, preferably 55%, 65%, 75%, 85%, 95%, or 99% 
5 identical to the amino acid sequence of SEQ ID NO: 2, SEQ 
ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 11, or SEQ 
ID NO: 13 and retains the functional activity of the 
Tango-77 proteins of SEQ ID NO:2, SEQ ID NO: 5, SEQ ID 
NO:7, SEQ ID NO:9, SEQ ID NO:ll, or SEQ ID NO:13. In a 

10 preferred embodiment, the Tango-77 protein retains a 
functional activity of the Tango-77 protein of- SEQ. ID 
NO:2, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID 
NO:ll, or SEQ ID NO: 13. 

To determine the percent identity of two amino 

15 acid sequences or of two nucleic acids, the sequences are 
aligned for optimal comparison purposes (e.g., gaps can 
be introduced in the sequence of a first amino acid or 
nucleic acid sequence for optimal alignment with a second 
amino or nucleic acid sequence) . The amino acid residues 

20 or nucleotides at corresponding amino acid positions or 
nucleotide positions are then compared. When a position 
in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in 
the second sequence, then the molecules are identical at 

25 that position. The percent identity between the two 
sequences is a function of the number of identical 
positions shared by the sequences (i.e., % identity = # 
of identical positions/total # of positions, e.g., 
overlapping x 100) . Preferably, the two sequences are 

30 the same length. 

The determination of percent homology between two 
sequences can be accomplished using a mathematical 
algorithm. A preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of two 

35 sequences is the algorithm of Karlin and Altschul (1990) 
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Proc. Natl. Acad. Sci. USA 87:2264-2268, modified as in 
Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 
90:5873-5877. Such an algorithm is incorporated into the 
NBLAST and XBLAST programs of Altschul, et al . (1990) 
5 J. Mol. Biol. 215:403-410. BLAST nucleotide searches can 
be performed with the NBLAST program, score = 100, 
wordlength = 12 to obtain nucleotide sequences homologous 
to Tango-77 nucleic acid molecules of the invention. 
BLAST protein searches can be performed with the XBLAST 

10 program, score = 50, wordlength = 3 to obtain amino acid 
sequences homologous to Tango- 7 7_. protein molecules of the - 
invention. To obtain gapped alignments for comparison 
purposes, Gapped BLAST can be utilized as described in 
Altschul et al. (1997) Nucleic Acids Res. 25:3389-3402. 

15 When utilizing BLAST and Gapped BLAST programs, the 
default parameters of the respective programs (e.g., 
XBLAST and NBLAST) can be used. See 

http://www.ncbi.nlm.nih.gov. Another preferred, non- 
limiting example of a mathematical algorithm utilized for 

20 the comparison of sequences is the algorithm of Myers and 
Miller, CABIOS (1989) . Such an algorithm is incorporated 
into the ALIGN program (version 2.0) which is part of the 
GCG sequence alignment software package. When utilizing 
the ALIGN program for comparing amino acid sequences, a 

25 PAM120 weight residue table, a gap length penalty of 12, 
and a gap penalty of 4 can be used. 

The percent identity between two sequences can be 
determined using techniques similar to those described 
above, with or without allowing gaps. In calculating 

30 percent identity, only exact matches are counted. 

The invention also provides Tango-77 chimeric or 
fusion proteins. As used herein, a Tango-77 "chimeric ' 
protein" or "fusion protein" comprises a Tango-77 
polypeptide operably linked to a non-Tango- 77 

35 polypeptide. A "Tango-77 polypeptide" refers to a 
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polypeptide having an amino acid sequence corresponding 
to Tango-77 polypeptides, whereas a "non-Tango-77 
polypeptide" refers to a polypeptide having an amino acid 
sequence corresponding to a protein which is not 
5 substantially identical to the Tango-77 protein, e.g., a 
protein which is different from the Tango-77 protein and 
which is derived from the same or a different organism. 
Within a Tango-77 fusion protein the Tango-77 polypeptide 
can correspond to all or a portion of a Tango-77 protein, 
10 preferably at least one biologically active portion of a 

Tango-77 protein. Within the fusion protein, the term 

■ ■ — 

"operably linked" is intended to indicate that the 
Tango-77 polypeptide and the non-Tango-77 polypeptide are 
fused in- frame to each other. The non-Tango-77 

15 polypeptide can be fused to the N- terminus or C- terminus 
of the Tango-77 polypeptide. 

One useful fusion protein is a GST-Tango- 77 fusion 
protein in which the Tango-77 sequences are fused to the 
C-terminus of the GST sequences. Such fusion proteins 

20 can facilitate the purification of recombinant Tango-77. 

In another embodiment, the fusion protein is a 
Tango-77 protein containing a heterologous signal 
sequence at its N-terminus. For example, the native 
Tango-77 signal sequence (i.e., about amino acids 1 to 63 

25 of SEQ ID NO:2; SEQ ID NO: 4; or about amino acids 1 to 52 
of SEQ ID NO: 7; SEQ ID NO: 8; or about amino acids 1 to 21 
of SEQ ID NO: 11; SEQ ID NO: 12) can be removed arid replaced 
with a signal sequence from another protein. In certain 
host cells (e.g., mammalian host cells), expression 

30 and/or secretion of Tango-77 can be increased through use 
of a heterologous signal sequence. For example, the gp67 
secretory sequence of the baculovirus envelope protein 
can be used as a heterologous signal sequence (Ausubel et 
al., supra). Other examples of eukaryotic heterologous 

35 signal sequences include the secretory sequences of 
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melittin and human placental alkaline phosphatase 
(Stratagene; La Jolla, California) . In yet another 
example, useful prokaryotic heterologous signal sequences 
include the phoA secretory signal {Sawbrook et al . , 
5 supra) and the protein A secretory signal (Pharmacia 
Biotech; Piscataway, New Jersey) . 

In yet another embodiment, the fusion protein is 
an Tango- 77 -immunoglobulin fusion protein in which all or 
part of Tango- 77 is fused to sequences derived from a 

10 member of the immunoglobulin protein family. The 

Tango- 77 -immunoglobulin fusion proteins of the invention 
can be incorporated into pharmaceutical compositions and 
administered to a subject to inhibit an interaction 
between a Tango-77 ligand and a Tango-77 receptor on the 

15 surface of a cell, to thereby suppress Tango- 77 -mediated 
signal transduction in vivo. The Tango- 77 -immunoglobulin 
fusion proteins can be used to affect the bioavailability 
of a Tango-77 cognate ligand. Inhibition of the Tango-77 
ligand/Tango-77 interaction may be useful therapeutically 

20 for both the treatment of inflammatory and autoimmune 

disorders. Moreover, the Tango-77-immunoglobulin fusion 
proteins of the invention can be used as immunogens to 
produce anti-Tango-77 antibodies in a subject, to purify 
Tango-77 ligands and in screening assays to identify 

25 molecules which inhibit the interaction of Tango-77 with 
a Tango-77 receptor. 

Preferably, a Tango-77 chimeric or fusion protein 
of the invention is produced by standard recombinant DNA 
techniques. For example, DNA fragments coding for the 

30 different polypeptide sequences are ligated together in- 
frame in accordance with conventional techniques, for 
example by employing blunt -ended or stagger-ended termini 
for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling- in of cohesive ends as 

35 appropriate, alkaline phosphatase treatment to avoid 
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undesirable joining, and enzymatic ligation. In another 
embodiment, the fusion gene can be synthesized by 
conventional techniques including automated DNA 
synthesizers. Alternatively, PCR amplification of gene 
5 fragments can be carried out using anchor primers which 
give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be 
annealed and reamplified to generate a chimeric gene 
sequence (see, e.g., Current Protocols in Molecular 

10 Biology, Ausubel et al . eds., John Wiley & Sons: 1992). 
Moreover, many expression vectors are .commercially 
available that already encode a fusion moiety (e.g., a 
GST polypeptide) . An Tango- 77 -encoding nucleic acid can 
be cloned into such an expression vector such that the 

15 fusion moiety is linked in-frame to the Tango-77 protein. 

The present invention also pertains to variants of 
the Tango-77 proteins (i.e., proteins having a sequence 
which differs from that of the Tango-77 amino acid 
sequence) . Such variants can function as either Tango-77 

20 agonists (mimetics) or as Tango-77 antagonists. Variants 
of the Tango-77 protein can be generated by mutagenesis, 
e.g., discrete point mutation or truncation of the 
Tango-77 protein. An agonist of the Tango-77 protein can 
retain substantially the same, or a subset, of the 

25 biological activities of the naturally occurring form of 
the Tango-77 protein. An antagonist of the Tango-77 
protein can inhibit one or more of the activities of the 
naturally occurring form of the Tango-77 protein by, for 
example, competitively binding to a downstream or 

30 upstream member of a cellular signaling cascade which 

includes the Tango-77 protein. Thus, specific biological 
effects can be elicited by treatment with a variant of 
limited function. Treatment of a subject with a variant 
having a subset of the biological activities of the 

35 naturally occurring form of the protein can have fewer 
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side effects in a subject relative to treatment with the 
naturally occurring form of the Tango-77 proteins. 

Variants of the Tango-77 protein which function as 
either Tango-77 agonists (mimetics) or as Tango-77 
5 antagonists can be identified by screening combinatorial 
libraries of mutants, e.g., truncation mutants, of the 
Tango-77 protein for Tango-77 protein agonist or 
antagonist activity. In one embodiment, a variegated 
library of Tango-77 variants is generated by 

10 combinatorial mutagenesis at the nucleic acid level and 
is encoded by a variegated . gene library. A variegated 
library of Tango-77 variants can be produced by, for 
example, enzymatically ligating a mixture of synthetic 
oligonucleotides into gene sequences such that a 

15 degenerate set of potential Tango-77 sequences is 

expressible as individual polypeptides, or alternatively, 
as a set of larger fusion proteins (e.g., for phage 
display) containing the set of Tango-77 sequences 
therein. There are a variety of methods which can be 

20 used to produce libraries of potential Tango-77 variants 
from a degenerate oligonucleotide sequence. Chemical 
synthesis of a degenerate gene sequence can be performed 
in an automatic DNA synthesizer, and the synthetic gene 
then ligated into an appropriate expression vector. Use 

25 of a degenerate set of genes allows for the provision, in 
one mixture, of all of the sequences encoding the desired 
set of potential Tango-77 sequences. Methods for 
synthesizing degenerate oligonucleotides are known in the 
art (see, e.g., Narang (1983) Tetrahedron 39:3; Itakura 

30 et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al . 
(1984) Science 198:1056; Ike et al . (1983) Nucleic Acid 
Res. 11:477) . 

In addition, libraries of fragments of the 
Tango-77 protein coding sequence can be used to generate 

35 a variegated population of Tango-77 fragments for 
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screening and subsequent selection of variants of a 
Tango- 77 protein. In one embodiment, a library of coding 
sequence fragments can be generated by treating a double 
stranded PCR fragment of a Tango-77 coding sequence with 
5 a nuclease under conditions wherein nicking occurs only 
about once per molecule, denaturing the double stranded 
DNA, renaturing the DNA to form double stranded DNA which 
can include sense/antisense pairs from different nicked 
products, removing single stranded portions from reformed 

10 duplexes by treatment with SI nuclease, and ligating the 
resulting fragment library into an expression vector. By 
this method, an expression library can be derived which 
encodes N- terminal and internal fragments of various 
sizes of the Tango-77 protein. 

is Several techniques are known in the art for 

screening gene products of combinatorial libraries made 
by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. 
Such techniques are adaptable for rapid screening of the 

20 gene libraries generated by the combinatorial mutagenesis 
of Tango-77 proteins. The most widely used techniques, 
which are amenable to high through-put analysis, for 
screening large gene libraries typically include cloning 
the gene library into replicable expression vectors, 

25 transforming appropriate cells with the resulting library 
of vectors, and expressing the combinatorial genes under 
conditions in which detection of a desired activity 
facilitates isolation of the vector encoding the gene 
whose product was detected. Recursive ensemble 

30 mutagenesis (REM) , a technique which enhances the 

frequency of functional mutants in the libraries, can be 
used in combination with the screening assays to identify 
Tango-77 variants (Arkin and Yourvan (1992) Proc. Natl. 
Acad. Sci. USA 89:7811-7815; Delgrave et al . (1993) 

35 Protein Engineering 6 (3 ): 327-331) . 
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An isolated Tango- 77 protein, or a portion or 
fragment thereof, can be used as an immunogen to generate 
antibodies that bind Tango-77 using standard techniques 
for polyclonal and monoclonal antibody preparation. The 
5 full-length Tango-77 protein can be used or, 

alternatively, the invention provides antigenic peptide 
fragments of Tango-77 for use as immunogens. The 
antigenic peptide of Tango-77 comprises at least 8 
(preferably 10, 15, 20, or 30) amino acid residues of the 

10 amino acid sequence shown in SEQ ID NO: 2, SEQ ID NO: 5, 
SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 11 or SEQ ID ..NO: 13 
and encompasses an epitope of Tango-77 such that an 
antibody raised against the peptide forms a specific 
immune complex with Tango-77. 

15 A Tango-77 immunogen typically is used to prepare 

antibodies by immunizing a suitable subject (e.g., 
rabbit, goat, mouse or other mammal) with the immunogen. 
Aii appropriate immunogenic preparation can contain, for 
example, recombinantly expressed Tango-77 protein or a 

20 chemically synthesized Tango-77 polypeptide. The 
preparation can further include an adjuvant, such as 
Freund's complete or incomplete adjuvant, or similar 
immunostimulatory agent. Immunization of a suitable 
subject with an immunogenic Tango-77 preparation induces 

25 a polyclonal anti-Tango-77 antibody response. 

Accordingly, another aspect of the invention 
pertains to anti-Tango-77 antibodies. The term 
"antibody" as used herein refers to immunoglobulin 
molecules and immunologically active portions of 

30 immunoglobulin molecules, i.e., molecules that contain an 
antigen binding site which specifically binds an antigen, 
such as Tango-77. A molecule which specifically binds to 
Tango-77 is a molecule which binds Tango-77, but does not 
substantially bind other molecules in a sample, e.g., a 

35 biological sample, which naturally contains Tango-77. 
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Examples of immunologically active portions of 
immunoglobulin molecules include F(ab) and F(ab') 2 
fragments which can be generated by treating the antibody 
with an enzyme such as pepsin. The invention provides 
5 polyclonal and monoclonal antibodies that bind Tango-77. 
The term "monoclonal antibody" or "monoclonal antibody 
composition", as used herein, refers to a population of 
antibody molecules that contain only one species of an 
antigen binding site capable of immunoreacting with a 
10 particular epitope of Tango-77. A monoclonal antibody 

composition thus typically displays a single binding 

affinity for a particular Tango-77 protein with which it 
immunoreacts . 



is prepared as described above by immunizing a suitable 
subject with a Tango-77 immunogen. The anti-Tango-77 
antibody titer in the immunized subject can be monitored 
over time by standard techniques, such as with an enzyme 
linked immunosorbent assay (ELISA) using immobilized 

20 Tango-77. If desired, the antibody molecules directed 
against Tango-77 can be isolated from the mammal (e.g., 
from the blood) and further purified by well-known 
techniques, such as protein A chromatography to obtain 
the IgG fraction. At an appropriate time after 

25 immunization, e.g., when the anti-Tango-77 antibody 
titers are highest, antibody-producing cells can be 
obtained from the subject and used to prepare monoclonal 
antibodies by standard techniques, such as the hybridoma 
technique originally described by Kohler and Milstein 

30 (1975) Nature 256:495-497, the human B cell hybridoma 
technique (Kozbor et al . (1983) Immunol Today 4:72), the 
EBV-hybridoma technique (Cole et al . (1985), Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 
77-96) or trioma techniques. The technology for 

35 producing hybridomas is well known (see generally Current 
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Protocols in Immunology (1994) Coligan et al . (eds.) John 
Wiley & Sons, Inc., New York, NY). Briefly, an immortal 
cell line (typically a myeloma) is fused to lymphocytes 
(typically splenocytes) from a mammal immunized with a 
5 Tango- 77 immunogen as described above, and the culture 
supernatants of the resulting hybridoma cells are 
screened to identify a hybridoma producing a monoclonal 
antibody that binds Tango-77. 

Any of the many well known protocols used for 

10 fusing lymphocytes and immortalized cell lines can be 
applied for the purpose of generating an anti-Tango-77 
monoclonal antibody {see, e.g., Current Protocols in 
Immunology, supra; Galfre et al . (1977) Nature 266:55052; 
R.H. Kenneth, in Monoclonal Antibodies: A New Dimension 

is In Biological Analyses, Plenum Publishing Corp., New 
York, New York (1980); and Lerner (1981) Yale J. Biol. 
Med., 54:3 87-402. Moreover, the ordinarily skilled 
worker will appreciate that there are many variations of 
such methods which also would be useful. Typically, the 

20 immortal cell line (e.g., a myeloma cell line) is derived 
from the same mammalian species as the lymphocytes. For 
example, murine hybridomas can be made by fusing 
lymphocytes from a mouse immunized with an immunogenic 
preparation of the present invention with an immortalized 

25 mouse cell line, e.g., a myeloma cell line that is 
sensitive to culture medium containing hypoxanthine, 
aminopterin and thymidine ("HAT medium"). Any of a 
number of myeloma cell lines can be used as a fusion 
partner according to standard techniques, e.g., the P3- 

30 NSl/l-Ag4-l, P3-x63-Ag8.653 or Sp2/0-Agl4 myeloma lines. 
These myeloma lines are available from ATCC. Typically, 
HAT-sensitive mouse myeloma cells are fused to mouse 
splenocytes using polyethylene glycol ("PEG") . Hybridoma 
cells resulting from the fusion are then selected using 

35 HAT medium, which kills unfused and unproductively fused 
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myeloma cells (unfused splenocytes die after several days 
because they are not transformed) . Hybridoma cells 
producing a monoclonal antibody of the invention are 
detected by screening the hybridoma culture supernatants 
5 for antibodies that bind Tango-77, e.g., using a standard 
ELISA assay. 

Alternative to preparing monoclonal antibody- 
secreting hybridomas, a monoclonal anti-Tango-77 antibody 
can be identified and isolated by screening a recombinant 

10 combinatorial immunoglobulin library (e.g., an antibody 
phage display library) with Tango-77 to thereby isolate 
immunoglobulin library members that bind Tango-77. Kits 
for generating and screening phage display libraries are 
commercially available (e.g., the Pharmacia jRecombinant 

15 Phage Antibody System, Catalog No. 27-9400-01; and the 
Stratagene SurfZAP™ Phage Display Kit, Catalog No. 
240612) . Additionally, examples of methods and reagents 
particularly amenable for use in generating and screening 
antibody display library can be found in, for example, 

20 U.S. Patent No. 5,223,409; PCT Publication No. WO 
92/18619; PCT Publication No. WO 91/17271; PCT 
Publication No. WO 92/20791; PCT Publication No. WO 
92/15679; PCT Publication No. WO 93/01288; PCT 
Publication No. WO 92/01047; PCT Publication No. WO 

25 92/09690; PCT Publication No. WO 90/02809; Fuchs et al . 
(1991) Bio/Technology 9:1370-1372; Hay et al . (1992) Hum. 
Antibod. Hybridomas 3:81-85; Huse et al . (1989) Science 
246:1275-1281; Griffiths et al . (1993) EMBO J 12:725-734. 
Additionally, recombinant anti-Tango-77 

30 antibodies, such as chimeric and humanized monoclonal 

antibodies, comprising both human and non-human portions, 
which can be made using standard recombinant DNA 
techniques, are within the scope of the invention. Such 
chimeric and humanized monoclonal antibodies can be 

35 produced by recombinant DNA techniques known in the art, 
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for example using methods described in PCT Publication 
No. WO 87/02671; European Patent Application 184,187; 
European Patent Application 171,496; European Patent 
Application 173,494; PCT Publication No. WO 86/01533; 
5 U.S. Patent No. 4,816,567; European Patent Application 
125,023; Better et al . (1988) Science 240:1041-1043; Liu 
et al. (1987) Proc. Natl. Acad. Sci . USA 84:3439-3443; 
Liu et al. (1987) J. Immunol. 139:3521-3526; Sun et al . 
(1987) Proc. Natl. Acad. Sci. USA 84:214-218; Nishimura 

10 et al. (1987) Cane. Res. 47:999-1005; Wood et al . (1985) 
Nature 314:446-449; and Shaw et al . (1988) J. Natl. 
Cancer Inst. 80:1553-1559); Morrison (1985) Science 
229:1202-1207; Oi et al . (1986) Bio/Techniques 4:214; 
U.S. Patent 5,225,539; Jones et al . (1986) Nature 

15 321:552-525; Verhoeyan et al . (1988) Science 239:1534; 
and Beidler et al . (1988) J. Immunol. 141:4053-4060. 

Completely human antibodies are particularly 
desirable for therapeutic treatment of human patients. 
Such antibodies can be produced using transgenic mice 

20 which are incapable of expressing endogenous 

immunoglobulin heavy and light chains genes, but which 
can express human heave and light chain genes. The 
transgenic mice are immunized in the normal fashion with 
a selected antigen, e.g., all or a portion of Tango-77. 

25 Monoclonal antibodies directed against the antigen can be 
obtained using conventional hybridoma technology. The 
human immunoglobulin transgenes harbored by the 
transgenic mice rearrange during B cell differentiation, 
and subsequently undergo class switching and somatic 

30 mutation. Thus, using such a technique, it is possible 
to produce therapeutically useful IgG, IgA and IgE 
antibodies. For an overview of this technology for 
producing human antibodies, see Lonberg and Huszar (1995, 
Jnt. Rev. Immunol. 13:65-93). For a detailed discussion 

35 of this technology for producing human antibodies and 
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human monoclonal antibodies and protocols for producing 
such antibodies, see, e.g., U.S. Patent 5,625,126; U.S. 
Patent 5,633,425; U.S. Patent .5, 569, 825; U.S. Patent 
5,661,016; and U.S. Patent 5,545,806. In addition, 
5 companies such as Abgenix, Inc. (Freemont, CA) , can be 
engaged to provide human antibodies directed against a 
selected antigen using technology similar to the 
described above. 



10 selected epitope can be generated using a technique 
referred to as "guided selection." In this approach a 
selected non-human monoclonal antibody, e.g., a murine 
antibody, is used to guide the selection of a completely 
human antibody recognizing the same epitope. 

is First, a non-human monoclonal antibody which binds 

a selected antigen (epitope), e.g., an antibody which 
inhibits Tango-77 activity, is identified. The heave 
chain and the light chain of the non-human antibody are 
cloned and used to create phage display Fab fragments. 

20 For example, the heave chain gene can be cloned into a 
plasmid vector so that the heavy chain can be secreted 
from bacteria. The light chain gene can be cloned into a 
phage coat protein gene so that the light chain can be 
expressed on the surface of phage. A repertoire (random 

25 collection) of human light chains fused to phage is used 
to infect the bacteria which express the non-human heavy 
chain. The resulting progeny phage display hybrid 
antibodies (human light chain/non-human heavy chain) . 
The selected antigen is used in a panning screen to 

30 select phage which bind the selected antigen. Several 
rounds of selection may be required to identify such 
phage. Next, human light chain genes are isolated from 
the selected phage which bind the selected antigen. 
These selected human light chain genes are then used to 

35 guide the selection of human heavy chain genes as 
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follows. The selected human light chain genes are 
inserted into vectors for expression by bacteria. 
Bacteria expressing the selected human light chains are 
infected with a repertoire of human heavy chains fused to 
5 phage. The resulting progeny phage display human 
antibodies (human light chain/human heavy chain) . 

Next, the selected antigen is used in a panning 
screen to select phage which bind the selected antigen. 
The phage selected in this step display completely human 

10 antibody which recognize the same epitope recognized by 
the original selected, non-human monoclonal antibody... - 
The genes encoding both the heavy and light chains are 
readily isolated and be further manipulated for 
production of human antibody. This technology is 

is described by Jespers et al . (1994, Bio/ technology 12:899- 
903) . 

An anti-Tango-77 antibody (e.g., monoclonal 
antibody) can be used to isolate Tango- 77 by standard 
techniques, such as affinity chromatography or 

20 immunoprecipitation. An anti-Tango-77 antibody can 
facilitate the purification of natural Tango-77 from 
cells and of recombinantly produced Tango-77 expressed in 
host cells. Moreover, an anti-Tango-77 antibody can be 
used to detect Tango-77 protein (e.g., in a cellular 

25 lysate or cell supernatant) in order to evaluate the 
abundance and pattern of expression of the Tango-77 
protein. Anti-Tango-77 antibodies can be used 
diagnostically to monitor protein levels in tissue as 
part of a clinical testing procedure, e.g., to, for 

30 example, determine the efficacy of a given treatment 
"regimen. Detection can be facilitated by coupling the 
antibody to a detectable substance. Examples of 
detectable substances include various enzymes, prosthetic 
groups, fluorescent materials, luminescent materials, 

35 bioluminescent materials, and radioactive materials. 
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Examples of suitable enzymes include horseradish 
peroxidase, alkaline phosphatase, /3-galactosidase, or 
acetylcholinesterase; examples of suitable prosthetic 
group complexes include streptavidin/biotin and 
5 avidin/biotin; examples of suitable fluorescent materials 
include umbellif erone, fluorescein, fluorescein 
isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example 
of a luminescent material includes luminol ; examples of 
10 bioluminescent materials include lucif erase, luciferin, 
and aequorin, and examples of suitable radioactive 
material include 125 1 , 131 I, 35 S or 3 H. 

III. Recombinant Expression Vectors and Host Cells 
Another aspect of the invention pertains to 

15 vectors, preferably expression vectors, containing a 
nucleic acid molecule encoding Tango- 77 (or a portion 
thereof) . As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting another 
nucleic acid to which it has been linked. One type of 

20 vector is a "plasmid", which refers to a circular double 
stranded DNA loop into which additional DNA segments can 
be ligated. Another type of vector is a viral vector, 
wherein additional DNA segments can be ligated into the 
viral genome. Certain vectors are capable of autonomous 

25 replication in a host cell into which they are introduced 
(e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors) . Other 
vectors (e.g., non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon 

30 introduction into the host cell, and thereby are 

replicated along with the host genome. Moreover, certain 
vectors, expression vectors, are capable of directing the 
expression of genes to which they are operably linked. 
In general, expression vectors of utility in recombinant 
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DNA techniques are often in the form of plasmids 
(vectors) . However, the invention is intended to include 
such other forms of expression vectors, such as viral 
vectors (e.g., replication defective retroviruses, 
5 adenoviruses and adeno-associated viruses) , which serve 
equivalent f unct ions . 



invention comprise a nucleic acid of the invention in a 
form suitable for expression of the nucleic acid in a 

10 host cell, which means that the recombinant expression 
vectors include one or more regulatory sequences, 
selected on the basis of the host cells to be used for 
expression, which is operably linked to the nucleic acid 
sequence to be expressed. Within a recombinant 

15 expression vector, "operably linked" is intended to mean 
that the nucleotide sequence of interest is linked to the 
regulatory sequence (s) in a manner which allows for 
expression of the nucleotide sequence (e.g., in an in 
vitro transcription/translation system or in a host cell 

20 when the vector is introduced into the host cell) . The 
term "regulatory sequence" is intended to include 
promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such 
regulatory sequences are described, for example, in 

25 Goeddel; Gene Expression Technology: Methods in 

Enzymology 185, Academic Press, San Diego, CA (1990) . 
Regulatory sequences include those which direct 
constitutive expression of a nucleotide sequence in many 
types of host cell and those which direct expression of 

30 the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences) . It will be 
appreciated by those skilled in the art that the design 
of the expression vector can depend on such factors as 
the choice of the host cell to be transformed, the level 

35 of expression of protein desired, etc. The expression 



The recombinant expression vectors of the 
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vectors of the invention can be introduced into host 
cells to thereby produce proteins or peptides, including 
fusion proteins or peptides, encoded by nucleic acids as 
described herein (e.g., Tango-77 proteins, mutant forms 
5 of Tango-77, fusion proteins, etc.). 



invention can be designed for expression of Tango-77 in 
prokaryotic or eukaryotic cells, e.g., bacterial cells 
such as E. coli, insect cells (using baculovirus 

10 expression vectors), yeast cells or mammalian cells. 
Suitable host cells are discussed further in Goeddel, 
Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, CA (1990) . Alternatively, the 
recombinant expression vector can be transcribed and 

15 translated in vitro, for example using T7 promoter 
regulatory sequences and T7 polymerase. 



often carried out in E. coli with vectors containing 
constitutive or inducible promoters directing the 

20 expression of either fusion or non-fusion proteins. 

Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the 
recombinant protein. Such fusion vectors typically serve 
three purposes: 1) to increase expression of recombinant 

25 protein; 2) to increase the solubility of the recombinant 
protein; and 3) to aid in the purification of the 
recombinant protein by acting as a ligand in affinity 
purification. Often, in fusion expression vectors, a 
proteolytic cleavage site is introduced at the junction 

30 of the fusion moiety and the recombinant protein to 
enable separation of the recombinant protein from the 



fusion moiety subsequent to purification of the fusion 
protein. Such enzymes, and their cognate recognition 
sequences, include Factor Xa, thrombin and enterokinase. 
35 Typical fusion expression vectors include pGEX (Pharmacia 



The recombinant expression vectors of the 



Expression of proteins in prokaryotes is most 
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Biotech Inc; Smith and Johnson (1988) Gene 67:31-40), 
pMAL (New England Biolabs, Beverly, MA) and pRIT5 
(Pharmacia, Piscataway, NJ) which fuse glutathione S- 
transferase (GST) , maltose E binding protein, or protein 
5 A, respectively, to the target recombinant protein. 

Examples of suitable inducible non- fusion E. coli 
expression vectors include pTrc (Amann et al . (1988) Gene 
69:301-315) and pET lid (Studier et al . , Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, 

10 San Diego, California (1990) 60-89) . Target gene 
expression from the pTrc vector relies on host RNA 
polymerase transcription from a hybrid trp-lac fusion 
promoter. Target gene expression from the pET lid vector 
relies on transcription from a T7 gnlO-lac fusion 

is promoter mediated by a coexpressed viral RNA polymerase 
(T7 gnl) . This viral polymerase is supplied by host 
strains BL21(DE3) or HMS174(DE3) from a resident X 
prophage harboring a T7 gnl gene under the 
transcriptional control of the lacUV 5 promoter. 

20 One strategy to maximize recombinant protein 

expression in E. coli is to express the protein in a host 
bacteria with an impaired capacity to proteolytically 
cleave the recombinant protein (Gottesman, Gene 
Expression Technology: Methods in Enzymology 185, 

25 Academic Press, San Diego, California (1990) 119-128) . 
Another strategy is to alter the nucleic acid sequence of 
the nucleic acid to be inserted into an expression vector 
so that the individual codons for each amino acid are 
those preferentially utilized in E. coli (Wada et al . 

30 (1992) Nucleic Acids Res. 20:2111-2118). Such alteration 
of nucleic acid sequences of the invention can be carried 
out by standard DNA synthesis techniques. 

In another embodiment, the Tango-77 expression 
vector is a yeast expression vector. Examples of vectors 

35 for expression in yeast S. cerivisae include pYepSecl 
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(Baldari et al . (1987) EMBO J. 6:229-234), pMFa (Kurjan 
and Herskowitz (1982) Cell 30:933-943), pJRY88 (Schultz 
et al. (1987) Gene 54:113-123), pYES2 (Invitrogen 
Corporation, San Diego, CA) , and picZ (InVitrogen Corp, 
5 San Diego, CA) . 

Alternatively, Tango-77 can be expressed in insect 
cells using baculovirus expression vectors. Baculovirus 
vectors available for expression of proteins in cultured 
insect cells (e.g., Sf 9 cells) include the pAc series 
10 (Smith et al . (1983) Mol. Cell Biol. 3:2156-2165) and the 
pVL series (Lucklow and Summers (1989) Virology 170:31- 
39) . 

In yet another embodiment, a nucleic acid of the 
invention is expressed in mammalian cells using a 

is mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed (1987) Nature 
329:840) and pMT2PC (Kaufman et al . (1987) EMBO J. 6:187- 
195) . When used in mammalian cells, the expression 
vector's control functions are often provided by viral 

20 regulatory elements. For example, commonly used 
promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable 
expression systems for both prokaryotic and eukaryotic 
cells see chapters 16 and 17 of Sambrook et al . {supra). 

25 In another embodiment, the recombinant mammalian 

expression vector is capable of directing expression of 
the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to 
express the nucleic acid) . Tissue- specif ic regulatory 

30 elements are known in the art. Non-limiting examples of 
suitable tissue- specif ic promoters include the albumin 
promoter (liver-specific; Pinkert et al . (1987) Genes 
Dev. 1:268-277), lymphoid- specif ic promoters (Calame and 
Eaton (1988) Adv. Immunol. 43:235-275), in particular 

35 promoters of T cell receptors (Winoto and Baltimore 
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(1989) EMBO J. 8:729-733) and immunoglobulins (Banerji et 
al. (1983) Cell 33:729-740; Queen and Baltimore (1983) 
Cell 33:741-748), neuron-specific promoters (e.g., the 
neurofilament promoter; Byrne and Ruddle (1989) Proc. 
5 Natl. Acad. Sci. USA 86:5473-5477), pancreas-specific 

promoters (Edlund et al . (1985) Science 230:912-916), and 
mammary gland-specific promoters (e.g., milk whey 
promoter; U.S. Patent No. 4,873,316 and European 
Application Publication No. 264,166). Developmental ly- 
xo regulated promoters are also encompassed, for example the 
murine liox promoters (Kessel and Gruss (1990) Science 
249:374-379) and the a- fetoprotein promoter (Campes and 
Tilghman (1989) Genes Dev. 3:537-546). 



is expression vector comprising a DNA molecule of the 
invention cloned into the expression vector in an 
antisense orientation. That is, the DNA molecule is 
operably linked to a regulatory sequence in a manner 
which allows for expression (by transcription of the DNA 

20 molecule) of an RNA molecule which is antisense to 

Tango- 77 mRNA. Regulatory sequences operably linked to a 
nucleic acid cloned in the antisense orientation can be 
chosen which direct the continuous expression of the 
antisense RNA molecule in a variety of cell types, for 

25 instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen which direct constitutive, tissue 
specific or cell type specific expression of antisense 
RNA. The antisense expression vector can be in the form 
of a recombinant plasmid, phagemid or attenuated virus in 

30 which antisense nucleic acids are produced under the 
control of a high efficiency regulatory region, the 
activity of which can be determined by the cell type into 
which the vector is introduced. For a discussion of the 
regulation of gene expression using antisense genes see 



The invention further provides a recombinant 
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Weintraub et al . (Reviews - Trends in Genetics, Vol. 1(1) 
1986) . 

Another aspect of the invention pertains to host 
cells into which a recombinant expression vector of the 
5 invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. 
It is understood that such terms refer not only to the 
particular subject cell but to the progeny or potential 
progeny of such a cell. Because certain modifications 
10 may occur in succeeding generations due to either 

mutation or environmental influences, such progeny may 
not, in fact, be identical to the parent cell, but are 
still included within the scope of the term as used 
herein. 

15 A host cell can be any prokaryotic or eukaryotic 

cell. For example, Tango-77 protein can be expressed in 
bacterial cells such as E. coli, insect cells, yeast or 
mammalian cells (such as Chinese hamster ovary cells 
(CHO) or COS cells) . Other suitable host cells are known 

20 to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or 
eukaryotic cells via conventional transformation or 
transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer 

25 to a variety of art -recognized techniques for introducing 
foreign nucleic acid (e.g., DNA) into a host cell, 
including calcium phosphate or calcium chloride co- 
precipitation, DEAE-dextran-mediated transfection, 
lipofection, or elect roporat ion. Suitable methods for 

30 transforming or transfecting host cells can be found in 
Sambrook, et al . (supra), and other laboratory manuals. 

For stable transfection of mammalian cells, it is 
known that, depending upon the expression vector and 
transfection technique used, only a small fraction of 

35 cells may integrate the foreign DNA into their genome. 
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In order to identify and select these integrants, a gene 
that encodes a selectable marker (e.g., for resistance to 
antibiotics) is generally introduced into the host cells 
along with the gene of interest. Preferred selectable 
5 markers include those which confer resistance to drugs, 
such as G418, hygromycin and methotrexate. Nucleic acid 
encoding a selectable marker can be introduced into a 
host cell on the same vector as that encoding Tango- 77 or 
can be introduced on a separate vector. Cells stably 

10 transfected with the introduced nucleic acid can be 
identified by drug selection (e.g., cells that have 
incorporated the selectable marker gene will survive, 
while the other cells die) . 

A host cell of the invention, such as a 

15 prokaryotic or eukaryotic host cell in culture, can be 
used to produce (i.e., express) Tango-77 protein. 
Accordingly, the invention further provides methods for 
producing Tango-77 protein using the host cells of the 
invention. In one embodiment, the method comprises 

20 culturing the host cell of invention (into which a 

recombinant expression vector encoding Tango-77 has been 
introduced) in a suitable medium such that Tango-77 
protein is produced. In another embodiment, the method 
further comprises isolating Tango-77 from the medium or 

25 the host cell. 

The host cells of the invention can also be used 
to produce nonhuman transgenic animals. For example, in 
one embodiment, a host cell of the invention is a 
fertilized oocyte or an embryonic stem cell into which 

3 0 Tango- 77 -coding sequences have been introduced. Such 
host cells can then be used to create non-human 
transgenic animals in which exogenous Tango-77 sequences 
have been introduced into their genome or homologous 
recombinant animals in which endogenous Tango-77 

35 sequences have been altered. Such animals are useful for 
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studying the function and/or activity of Tango-77 and for 
identifying and/or evaluating modulators of Tango-77 
activity. As used herein, a "transgenic animal" is a 
non-human animal, preferably a mammal, more preferably a 
5 rodent such as a rat or mouse, in which one or more of 
the cells of the animal includes a transgene. Other 
examples of transgenic animals include non-human 
primates, sheep, dogs, cows, goats, chickens, amphibians, 
etc. A transgene is exogenous DNA which is integrated 

10 into the genome of a cell from which a transgenic animal 
develops and which remains in the genome of the mature 
animal, thereby directing the expression of an encoded 
gene product in one or more cell types or tissues of the 
transgenic animal. As used herein, an "homologous 

15 recombinant animal" is a non-human animal, preferably a 
mammal, more preferably a mouse, in which an endogenous 
Tango-77 gene has been altered by homologous 
recombination between the endogenous gene and an 
exogenous DNA molecule introduced into a cell of the 

20 animal, e.g., an embryonic cell of the animal, prior to 
development of the animal . 



created by introducing Tango- 77 -encoding nucleic acid 
into the male pronuclei of a fertilized oocyte, e.g., by 

25 microinjection, retroviral infection, and allowing the 
oocyte to develop in a pseudopregnant female foster 
animal. The Tango-77 cDNA sequence e.g., that of (SEQ ID 
NO:l, SEQ ID NO: 3, SEQ ID NO: 6; SEQ ID NO: 10 or the cDNA 
of ATCC 98807) can be introduced as a transgene into the 

30 genome of a non-human animal. Alternatively, a nonhuman 
homologue of the human Tango-77 gene, such as a mouse 
Tango-77 gene, can be isolated based on hybridization to 
the human Tango-77 cDNA and used as a transgene, 
Intronic sequences and polyadenylation signals can also 

35 be included in the transgene to increase the efficiency 



A transgenic animal of the invention can be 
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of expression of the transgene. A tissue-specific 
regulatory sequence (s) can be operably linked to the 
Tango- 77 transgene to direct expression of Tango- 77 
protein to particular cells. Methods for generating 
5 transgenic animals via embryo manipulation and 

microinjection, particularly animals such as mice, have 
become conventional in the art and are described, for 
example, in U.S. Patent Nos. 4,736,866 and 4,870,009, 
U.S. Patent No. 4,873,191 and in Hogan, Manipulating the 

10 Mouse Embryo (Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, N.Y., 1986). Similar methods are used for 
production of other transgenic animals. A transgenic 
founder animal can be identified based upon the presence 
of the Tango- 77 transgene in its genome and/or expression 

15 of Tango-77 mRNA in tissues or cells of the animals. A 
transgenic founder animal can then be used to breed 
additional animals carrying the transgene. Moreover, 
transgenic animals carrying a transgene encoding Tango-77 
can further be bred to other transgenic animals carrying 

20 other transgenes. 



vector is prepared which contains at least a portion of a 
Tango-77 gene (e.g., a human or a non-human homolog of 
the Tango-77 gene, e.g., a murine Tango-77 gene) into 

25 which a deletion, addition or substitution has been 

introduced to thereby alter, e.g., functionally disrupt, 
the Tango-77 gene. In a preferred embodiment, the vector 
is designed such that, upon homologous recombination, the 
endogenous Tango-77 gene is functionally disrupted (i.e., 

30 no longer encodes a functional protein; also referred to 
as a "knock out" vector) . Alternatively, the vector 

can be designed such that, upon homologous recombination, 
the endogenous Tango-77 gene is mutated or otherwise 
altered but still encodes functional protein (e.g., the 

35 upstream regulatory region can be altered to thereby 



To create an homologous recombinant animal, a 
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alter the expression of the endogenous Tango- 77 protein) . 
In the homologous recombination vector, the altered 
portion of the Tango- 77 gene is flanked at its 5' and 3' 
ends by additional nucleic acid of the Tango- 77 gene to 
5 allow for homologous recombination to occur between the 
exogenous Tango- 77 gene carried by the vector and an 
endogenous Tango-77 gene in an embryonic stem cell. The 
additional flanking Tango-77 nucleic acid is of 
sufficient length for successful homologous recombination 

10 with the endogenous gene. Typically, several kilobases 
of flanking DNA (both at the 5' and 3' ends) .are included 
in the vector (see, e.g., Thomas and Capecchi (1987) Cell 
51:503 for a description of homologous recombination 
vectors) . The vector is introduced into an embryonic 

15 stem cell line (e.g., by electroporation) and cells in 
which the introduced Tango-77 gene has homologously 
recombined with the endogenous Tango-77 gene are selected 
(see, e.g., Li et al. (1992) Cell 69:915). The selected 
cells are then injected into a blastocyst of an animal 

20 (e.g., a mouse) to form aggregation chimeras (see, e.g., 
Bradley in Teratocarcinomas and Embryonic Stem Cells: A 
Practical Approach, Robertson, ed. (IRL, Oxford, 1987) 
pp. 113-152) . A chimeric embryo can then be implanted 
into a suitable pseudopregnant female foster animal and 

25 the embryo brought to term. Progeny harboring the 

homologously recombined DNA in their germ cells can be 
used to breed animals in which all cells of the animal 
contain the homologously recombined DNA by germline 
transmission of the transgene. Methods for constructing 

30 homologous recombination vectors and homologous 

recombinant animals are described further in Bradley 
(1991) Current Opinion in Bio/Technology 2:823-829 and in 
PCT Publication Nos. WO 90/11354, WO 91/01140, WO 
92/0968, and WO 93/04169. 
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In another embodiment, transgenic non-human 
animals can be produced which contain selected systems 
which allow for regulated expression of the transgene. 
One example of such a system is the cre/loxP recombinase 
5 system of bacteriophage PI. For a description of the 
cre/loxP recombinase system, see, e.g., Lakso et al . 
(1992) Proc. Natl. Acad. Sci. USA 89:6232-6236. Another 
example of a recombinase system is the FLP recombinase 
system of Saccharomyces cerevisiae (O' Gorman et al . 

10 (1991) Science 251:1351-1355. If a cre/loxP recombinase 
system is used to regulate expression of the transgene, 
animals containing transgenes encoding both the Cre 
recombinase and a selected protein are required. Such 
animals can be provided through the construction of 

15 "double" transgenic animals, e.g., by mating two 

transgenic animals, one containing a transgene encoding a 
selected protein and the other containing a transgene 
encoding a recombinase. 

Clones of the non-human transgenic animals 

20 described herein can also be produced according to the 
methods described in Wilmut et al . (1997) Nature 385 : 810- 
813 and PCT Publication Nos. WO 97/07668 and WO 97/07669. 
In brief, a cell, e.g., a somatic cell, from the 
transgenic animal can be isolated and induced to exit the 

25 growth cycle and enter G 0 phase. The quiescent cell can 
then be fused, e.g., through the use of electrical 
pulses, to an enucleated oocyte from an animal of the 
same species from which the quiescent cell is isolated. 
The reconstructed oocyte is then cultured such that it 

30 develops to morula or blastocyte and then transferred to 
pseudopregnant female foster animal. The offspring borne 
of this female foster animal will be a clone of the 
animal from which the cell, e.g., the somatic cell, is 
isolated. 
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IV. Pharmaceutical Compositions 

The Tango- 77 nucleic acid molecules, Tango- 77 
proteins, and anti-Tango-77 antibodies (also referred to 
herein as "active compounds") of the invention can be 
5 incorporated into pharmaceutical compositions suitable 
for administration. Such compositions typically comprise 
the nucleic acid molecule, protein, or antibody and a 
pharmaceutically acceptable carrier. As used herein the 
language "pharmaceutically acceptable carrier" is 

10 intended to include any and all solvents, dispersion 
media, coatings, antibacterial and_anti fungal agents, 
isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. The use 
of such media and agents for pharmaceutically active 

is substances is well known in the art. Except insofar as 
any conventional media or agent is incompatible with the 
active compound, use thereof in the compositions is 
contemplated. Supplementary active compounds can also be 
incorporated into the compositions. 

20 A pharmaceutical composition of the invention is 

formulated to be compatible with its intended route of 
administration. Examples of routes of administration 
include parenteral, (e.g. intravenous, intradermal, 
subcutaneous) (e.g., oral inhalation), transdermal 

25 (topical), transmucosal , and rectal administration. 
Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous application can include the 
following components: a sterile diluent such as water for 
injection, saline solution, fixed oils, polyethylene 

30 glycols, glycerine, propylene glycol or other synthetic 
solvents; antibacterial agents such as benzyl alcohol or 
methyl parabens; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as 
ethylenediaminetetraacetic acid; buffers such as 

35 acetates, citrates or phosphates and agents for the 
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adjustment of tonicity such as sodium chloride or 
dextrose. pH can be adjusted with acids or bases, such 
as hydrochloric acid or sodium hydroxide. The parenteral 
preparation can be enclosed in ampoules, disposable 
5 syringes or multiple dose vials made of glass or plastic. 



injectable use include sterile aqueous solutions (where 
water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable 

10 solutions or dispersions. For intravenous 

administration, suitable carriers include physiological 
saline, bacteriostatic water, Cremophor EL™ (BASF; 
Parsippany, NJ) or phosphate buffered saline (PBS) . In 
all cases, the composition must be sterile and should be 

15 fluid to the extent that easy syringability exists. It 
must be stable under the conditions of manufacture and 
storage and must be preserved against the contaminating 
action of microorganisms such as bacteria and fungi. The 
carrier can be a solvent or dispersion medium containing, 

20 for example, water, ethanol, polyol (for example, 
glycerol, propylene glycol, and liquid polyetheylene 
glycol, and the like), and suitable mixtures thereof. 
The proper fluidity can be maintained, for example, by 
the use of a coating such as lecithin, by the maintenance 

25 of the required particle size in the case of dispersion 
and by the use of surfactants. Prevention of the action 
of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, 
parabens, chlorobutanol, phenol, ascorbic acid, 

30 thimerosal, and the like. In many cases, it will be 
preferable to include isotonic agents, for example, 
sugars, polyalcohols such as mannitol, sorbitol, sodium 
chloride in the composition. Prolonged absorption of the 
injectable compositions can be brought about by including 



Pharmaceutical compositions suitable for 
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in the composition an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by 
incorporating the active compound (e.g., a Tango-77 
5 protein or anti-Tango-77 antibody) in the required amount 
in an appropriate solvent with one or a combination of 
ingredients enumerated above, as required, followed by 
filtered sterilization. Generally, dispersions are 
prepared by incorporating the active compound into a 

10 sterile vehicle which contains a basic dispersion medium 
and the required other ingredients from those enumerated 
above. In the case of sterile powders for the 
preparation of sterile injectable solutions, the 
preferred methods of preparation are vacuum drying and 

15 freeze-drying which yields a powder of the active 

ingredient plus any additional desired ingredient from a 
previously sterile- filtered solution thereof. 

Oral compositions generally include an inert 
diluent or an edible carrier. They can be enclosed in 

20 gelatin capsules or compressed into tablets. For the 
purpose of oral therapeutic administration, the active 
compound can be incorporated with excipients and used in 
the form of tablets, troches, or capsules. Oral 
compositions can also be prepared using a fluid carrier 

25 for use as a mouthwash, wherein the compound in the fluid 
carrier is applied orally and swished and expectorated or 
swallowed. Pharmaceutically compatible binding agents, 
and/or adjuvant materials can be included as part of the 
composition. The tablets, pills, capsules, troches and 

30 the like can contain any of the following ingredients, or 
compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an 
excipient such as starch or lactose, a disintegrating 
agent such as alginic acid, Primogel, or corn starch; a 

35 lubricant such as magnesium stearate or Sterotes; a 
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glidant such as colloidal silicon dioxide; a sweetening 
agent such as sucrose or saccharin; or a flavoring agent 
such as peppermint, methyl salicylate, or orange 
flavoring. 

5 For administration by inhalation, the compounds 

are delivered in the form of an aerosol spray from a 
pressurized container or dispenser which contains a 
suitable propellant, e.g., a gas such as carbon dioxide, 
or a nebulizer. 

10 Systemic administration can also be by 

transmucosal or transdermal means. For transmucosal. or ... 
transdermal administration, penetrants appropriate to the 
barrier to be permeated are used in the formulation. 
Such penetrants are generally known in the art, and 

is include, for example, for transmucosal administration, 
detergents, bile salts, and fusidic acid derivatives. 
Transmucosal administration can be accomplished through 
the use of nasal sprays or suppositories. For 
transdermal administration, the active compounds are 

20 formulated into ointments, salves, gels, or creams as 
generally known in the art. 

The compounds can also be prepared in the form of 
suppositories (e.g., with conventional suppository bases 
such as cocoa butter and other glycerides) or retention 

25 enemas for rectal delivery. 

In one embodiment, the active compounds are 
prepared with carriers that will protect the compound 
against rapid elimination from the body, such as a 
controlled release formulation, including implants and 

30 microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene 
vinyl acetate, polyanhydrides , polyglycolic acid, 
collagen, polyorthoesters, and polylactic acid. Methods 
for preparation of such formulations will be apparent to 

35 those skilled in the art. The materials can also be 
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obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including 
liposomes targeted to infected cells with monoclonal 
antibodies to viral antigens) can also be used as 
5 pharmaceutical^ acceptable carriers. These can be 
prepared according to methods known to those skilled in 
the art, for example, as described in U.S. Patent No. 
4,522,811. 

It is especially advantageous to formulate oral or 

10 parenteral compositions in dosage unit form for ease of 
administration and uniformity of dosage.. Dosage unit 
form as used herein refers to physically discrete units 
suited as unitary dosages for the subject to be treated; 
each unit containing a predetermined quantity of active 

15 compound calculated to produce the desired therapeutic 
effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of 
the invention are dictated by and directly dependent on 
the unique characteristics of the active compound and the 

20 particular therapeutic effect to be achieved, and the 
limitations inherent in the art of compounding such an 
active compound for the treatment of individuals. 

The nucleic acid molecules of the invention can be 
inserted into vectors and used as gene therapy vectors. 

25 Gene therapy vectors can be delivered to a subject by, 
for example, intravenous injection, local administration 
(U.S. Patent 5,328,470) or by stereotactic injection 
(see, e.g., Chen et al . (1994) Proc. Natl. Acad. Sci. USA 
91:3054-3057). The pharmaceutical preparation of the 

30 gene therapy vector can include the gene therapy vector 
in an acceptable diluent, or can comprise a slow release 
matrix in which the gene delivery vehicle is imbedded. 
Alternatively, where the complete gene delivery vector 
can be produced intact from recombinant cells, e.g. 

35 retroviral vectors, the pharmaceutical preparation can 
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include one or more cells which produce the gene delivery- 
system. 

The pharmaceutical compositions can be included in 
a container, pack, or dispenser together with 
5 instructions for administration. 

V. Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein 
homologues, and antibodies described herein can be used 
in one or more of the following methods: a) screening 

10 assays; b) detection assays^ (e.g., chromosomal mapping, . 
tissue typing, forensic biology) ; c) predictive medicine 
(e.g., diagnostic assays, prognostic assays, monitoring 
clinical trials, and pharmacogenomics) ; and d) methods of 
treatment (e.g., therapeutic and prophylactic). A 

15 Tango- 77 protein interacts with other cellular proteins 
and can thus be used for regulation of inflammation. The 
polypeptides of the invention can be used in assays to 
determine biological activity. For example, they could 
be used in a panel of proteins for high- throughput 

20 screening. 

The isolated nucleic acid molecules of the 
invention can be used to express Tango-77 protein (e.g., 
via a recombinant expression vector in a host cell in 
gene therapy applications) , to detect Tango-77 mRNA 

25 (e.g., in a biological sample) or a genetic lesion in a 
Tango-77 gene, and to modulate Tango-77 activity. In 
addition, the Tango-77 proteins can be used to screen 
drugs or compounds which modulate the Tango-77 activity 
or expression as well as to treat disorders characterized 

30 by insufficient or excessive production of Tango-77 
protein or production of Tango-77 protein forms which 
have decreased or aberrant activity compared to Tango-77 
wild type protein. In addition, the anti -Tango-77 
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antibodies of the invention can be used to detect and 
isolate Tango- 77 proteins and modulate Tango- 77 activity. 

This invention further pertains to novel agents 
identified by the above -described screening assays and 
5 uses thereof for treatments as described herein. 

A. Screening Assays 

The invention provides a method (also referred to 

herein as a "screening assay") for identifying 

modulators, i.e., candidate or test compounds or agents 
10 (e.g., peptides, peptidomimetics , small molecules or. 

other drugs) which bind to Tango-77 proteins or have a 

stimulatory or inhibitory effect on, for example, 

Tango-77 expression or Tango-77 activity. 

Examples of methods for the synthesis of molecular 
15 libraries can be found in the art, for example in: 

DeWitt et al. (1993) Proc. Natl. Acad. Sci. USA 90:6909; 

Erb et al. (1994) Proc. Natl. Acad. Sci. USA 91:11422; 

Zuckermann et al . (1994). J. Med. Chew. 37:2678; Cho et 

al. (1993) Science 261:1303; Carrell et al . (1994) Angew. 
20 Chem. Int. Ed. Engl. 33:2059; Carell et al . (1994) Angew. 

Chem. Int. Ed. Engl. 33:2061; and Gallop et al . (1994) J. 

Med. Chem. 37:1233. 

Libraries of compounds may be presented in 

solution (e.g., Houghten (1992) Bio/Teciinigues 13:412- 
25 421), or on beads (Lam (1991) Nature 354:82-84), chips 

(Fodor (1993) Nature 364 : 555-556) , bacteria (U.S. Patent 

No. 5,223,409), spores (Patent Nos. 5,571,698; 5,403,484; 

and 5,223,409), plasmids (Cull et al . (1992) Proc. Natl. 

Acad. Sci. USA 89:1865-1869) or phage (Scott and Smith 
30 (1990) Science 249 :386-390; Devlin (1990) Science 

249:404-406; Cwirla et al . (1990) Proc. Natl. Acad. Sci. 

USA 87:6378-6382; and Felici (1991) J. Mol. Biol. 

222:301-310) . 
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In another embodiment, an assay is used to 
determine the ability of the test compound to modulate 
the activity of Tango-77 or a biologically active portion 
thereof, for example, by determining the ability of the 
5 Tango-77 protein to bind to or interact with a Tango-77 
target molecule. As used herein, a "target molecule" is 
a molecule with which a Tango-77 protein binds or 
interacts in nature, for example, a molecule on the 
surface of a cell. A Tango-77 target molecule can be a 

10 non-Tango-77 molecule or a Tango-77 protein or 

polypeptide of the present invention. In one embodiment, 
a Tango-77 target molecule is a component of a signal 
transduction pathway, for example, Tango-77 may bind to a 
IL-1 receptor or another receptor thereby blocking the 

15 receptor and inhibiting future signal transduction. 

Determining the ability of the Tango-77 protein to bind 
to or interact with a Tango-77 target molecule can be 
accomplished by one of the methods described above. In a 
preferred embodiment, determining the ability of the 

20 Tango-77 protein to bind to or interact with a Tango-77 
target molecule can be accomplished by determining the 
activity of the target molecule. For example, the 
activity of the target molecule can be determined by 
detecting induction of a cellular second messenger of the 

25 target (e.g., intracellular Ca 2 *, diacylglycerol , IP3, 
etc.), detecting catalytic/enzymatic activity of the 
target on an appropriate substrate, detecting the 
induction of a reporter gene (e.g., a Tango- 77-responsive 
regulatory element operably linked to a nucleic acid 

30 encoding a detectable marker, e.g. luciferase) , or 

detecting a cellular response, for example, inflammation. 

In yet another embodiment, an assay of the present 
invention is a cell -free assay comprising contacting a 
Tango-77 protein or biologically active portion thereof 

35 with a test compound and determining the ability of the 
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test compound to bind to the Tango-77 protein or 
biologically active portion thereof. Binding of the test 
compound to the Tango-77 protein can be determined either 
directly or indirectly as described above. In a 
5 preferred embodiment, the assay includes contacting the 
Tango-77 protein or biologically active portion thereof 
with a known compound which binds Tango-77 to form an 
assay mixture, contacting the assay mixture with a test 
compound, and determining the ability of the test 

10 compound to interact with a Tango-77 protein,, wherein 
determining the ability of the test compound to interact 
with a Tango-77 protein comprises determining the ability 
of the test compound to preferentially bind to Tango-77 
or biologically active portion thereof as compared to the 

is known compound. 

In another embodiment, an assay is a cell -free 
assay comprising contacting Tango-77 protein or 
biologically active portion thereof with a test compound 
and determining the ability of the test compound to 

20 modulate (e.g., stimulate or inhibit) the activity of the 
Tango-77 protein or biologically active portion thereof. 
Determining the ability of the test compound to modulate 
the activity of Tango-77 can be accomplished, for 
example / by determining the ability of the Tango-77 

25 protein to bind to a Tango-77 target molecule by one of 
the methods described above for determining direct 
binding. In an alternative embodiment, determining the 
ability of the test compound to modulate the activity of 
Tango-77 can be accomplished by determining the ability 

30 of the Tango-77 protein to further modulate a Tango-77 
target molecule. For example, the catalytic/enzymatic 
activity of the target molecule on an appropriate 
substrate can be determined as previously described. 

In yet another embodiment, the cell -free assay 

35 comprises contacting the Tango-77 protein or biologically 
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active portion thereof with a known compound which binds 
Tango-77 to form an assay mixture, contacting the assay- 
mixture with a test compound, and determining the ability 
of the test compound to interact with a Tango-77 protein, 
5 wherein determining the ability of the test compound to 
interact with a Tango-77 protein comprises determining 
the ability of the Tango-77 protein to preferentially 
bind to or modulate the activity of a Tango-77 target 
molecule . 

10 It is possible that membrane -bound forms of Tango- 

77 exist. The cell-free assays of the present invention 
are amenable to use of both the forms Tango-77. In the 
case of cell -free assays comprising a membrane -bound form 
of Tango-77, it may be desirable to utilize a 

is solubilizing agent such that the membrane -bound form of 
Tango-77 is maintained in solution. Examples of such 
solubilizing agents include non- ionic detergents such as 
n-octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, 
octanoyl -N-methylglucamide , decanoyl -N-methylglucamide , 

20 Triton® X-100, Triton® X-114, Thesit®, 

Isotridecypoly (ethylene glycol ether)n, 3-[(3- 
chol ami dop ropy 1) dimethylamminio] -1-propane sulfonate 
(CHAPS) , 3- [ ( 3 -cholamidopropyl) dimethylamminio] -2- 
hydroxy- 1-propane sulfonate (CHAPSO) , or N-dodecyl=N,N- 

25 dimethyl -3 -ammonio- 1-propane sulfonate. 



methods of the present invention, it may be desirable to 
immobilize either Tango-77 or its target molecule to 
facilitate separation of complexed from uncomplexed forms 

30 of one or both of the proteins, as well as to accommodate 
automation of the assay. Binding of a test compound to 
Tango-77, or interaction of Tango-77 with a target 
molecule in the presence and absence of a candidate 
compound, can be accomplished in any vessel suitable for 

35 containing the reactants. Examples of such vessels 



In more than one embodiment of the above assay 
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include mi c rot it re plates, test tubes, and micro- 
centrifuge tubes. In one embodiment, a fusion protein 
can be provided which adds a domain that allows one or 
both of the proteins to be bound to a matrix. For 
5 example, glutathione-S-transf erase/ Tango-77 fusion 
proteins or glutathione-S-transf erase/target fusion 
"proteins can be adsorbed onto glutathione sepharose beads 
(Sigma Chemical Co.; St. Louis, MO) or glutathione 
derivatized microti tre plates, which are then combined 

10 with the test compound or the test compound and either 
the non- adsorbed target protein or Tango-77 protein, and 
the mixture incubated under conditions conducive to 
complex formation (e.g., at physiological conditions for 
salt and pH) . Following incubation, the beads or 

15 microtitre plate wells are washed to remove any unbound 
components and complex formation is measured either 
directly or indirectly, for example, as described above. 
Alternatively, the complexes can be dissociated f rom< the 
matrix, and the level of Tango-77 binding or activity 

20 determined using standard techniques. 



matrices can also be used in the screening assays of the 
invention. For example, either Tango-77 or its target 
molecule can be immobilized utilizing conjugation of 

25 biotin and streptavidin. Biotinylated Tango-77 or target 
molecules can be prepared from biotin-NHS (N-hydroxy- 
succinimide) using techniques well known in the art 
(e.g., biotinylation kit, Pierce Chemicals; Rockford, 
IL) , and immobilized in the wells of streptavidin- coated 

30 96 well plates (Pierce Chemical) . Alternatively, 

antibodies reactive with Tango-77 or target molecules but 
which do not interfere with binding of the Tango-77 
protein to its target molecule can be derivatized to the 
wells of the plate, and unbound target or Tango-77 

35 trapped in the wells by antibody conjugation. Methods 



Other techniques for immobilizing proteins on 
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for detecting such complexes, in addition to those 
described above for the GST -immobilized complexes, 
include immunodetection of complexes using antibodies 
reactive with the Tango-77 or target molecule, as well as 
5 enzyme-linked assays which rely on detecting an enzymatic 
activity associated with the Tango-77 or target molecule. 



expression are identified in a method in which a cell is 
contacted with a candidate compound and the expression of 

10 Tango-77 mRNA or protein in the cell is determined. The 
level of expression of Tango-77 mRNA or protein in the 
presence of the candidate compound is compared to the 
level of expression of Tango-77 mRNA or protein in the 
absence of the candidate compound. The candidate 

15 compound can then be identified as a modulator of 
Tango-77 expression based on this comparison. For 
example, when expression of Tango-77 mRNA or protein is 
greater (statistically significantly greater) in the 
presence of the candidate compound than in its absence, 

20 the candidate compound is identified as a stimulator of 
Tango-77 mRNA or protein expression. Alternatively, when 
expression of Tango-77 mRNA or protein is less 
(statistically significantly less) in the presence of the 
candidate compound than in its absence, the candidate 

25 compound is identified as an inhibitor of Tango-77 mRNA 
or protein expression. The level of Tango-77 mRNA or 
protein expression in the cells can be determined by 
methods described herein for detecting Tango-77 mRNA or 
protein. 

30 In yet another aspect of the invention, the 

Tango-77 proteins can be used as "bait proteins" in a 
two-hybrid assay or three hybrid assay (see, e.g., U.S. 
Patent No. 5,283,317; Zervos et al . (1993) Cell 72:223- 
232; Madura et al . (1993) J. Biol. Chem. 268:12046-12054; 

35 Bartel et al . (1993) Bio/Techniques 14:920-924; Iwabuchi 



In another embodiment, modulators of Tango-77 
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et al. (1993) Oncogene 8:1693-1696; and PCT Publication 
No. WO 94/10300) , to identify other proteins, which bind 
to or interact with Tango- 77 ( "Tango- 77 -binding proteins" 
or "Tango-77-bp") and modulate Tango-77 activity. Such 
5 Tango-77 -binding proteins are also likely to be involved 
in the propagation of signals by the Tango-77 proteins 
as, for example, upstream or downstream elements of the 
Tango-77 pathway. 

The two-hybrid system is based on the modular 

10 nature of most transcription factors, which consist of 
separable DNA-binding and activation domains. Briefly, 
the assay utilizes two different DNA constructs. In one 
construct, the gene that codes for Tango-77 is fused to a 
gene encoding the DNA binding domain of a known 

15 transcription factor (e.g., GAL- 4) . In the other 
construct, a DNA sequence, from a library of DNA 
sequences, that encodes an unidentified protein ("prey" 
or "sample") is fused to a gene that codes for the 
activation domain of the known transcription factor. If 

20 the "bait" and the "prey" proteins are able to interact, 
in vivo, forming an Tango- 77 -dependent complex, the DNA- 
binding and activation domains of the transcription 
factor are brought into close proximity. This proximity 
allows transcription of a reporter gene (e.g., LacZ) 

25 which is operably linked to a transcriptional regulatory 
site responsive to the transcription factor. Expression 
of the reporter gene can be detected and cell colonies 
containing the functional transcription factor can be 
isolated and used to obtain the cloned gene which encodes 

30 the protein which interacts with Tango-77. 

This invention further pertains to novel agents 
identified by the above-described screening assays and 
uses thereof for treatments as described herein. 
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B - Detection Assays 

Portions or fragments of the cDNA sequence 
identified herein (and the corresponding complete gene 
sequences) can be used in numerous ways as polynucleotide 
5 reagents. For example, the sequence can be used to: (i) 
map the respective gene on a chromosome and, thus, locate 
gene regions associated with genetic disease; (ii) 
identify an individual from a minute biological sample 
(tissue typing) ; and (iii) aid in forensic identification 
10 of a biological sample. These applications are described 
in the subsections below. _ -f 

1 . Chromosome Mapping 

Once the sequence (or a portion of the sequence) 
of a gene has. been isolated, this sequence can be used to 

15 map the location of the gene on a chromosome. 

Accordingly, Tango- 77 nucleic acid molecules described 
herein or fragments thereof, can be used to map the 
location of the Tango-77 gene(s) on a chromosome. The 
mapping of the Tango-77 sequences to chromosomes is an 

20 important first step in correlating these sequences with 
genes associated with disease. 

Briefly, a Tango-77 gene can be mapped to 
chromosomes by preparing PCR primers (preferably 15-25 bp 
in length) from the Tango-77 sequences. Computer 

25 analysis of Tango-77 sequences can be used to rapidly 

select primers that do not span more than one exon in the 
genomic DNA, thus complicating the amplification process. 
These primers can then be used for PCR screening of 
somatic cell hybrids containing individual human 

30 chromosomes. Only those hybrids containing the human 
gene corresponding to the Tango-77 sequences will yield 
an amplified fragment. 

Somatic cell hybrids are prepared by fusing 
somatic cells from different mammals (e.g., human and 
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mouse cells) . As hybrids of human and mouse cells grow 
and divide, they gradually lose human chromosomes in 
random order, but retain the mouse chromosomes. By using 
media in which mouse cells cannot grow (because they lack 
5 a particular enzyme) but in which human cells can, the 
one human chromosome that contains the gene encoding the 
needed enzyme, will be retained. By using various media, 
panels of hybrid cell lines can be established. Each 
cell line in a panel contains either a single human 

10 chromosome or a small number of human chromosomes, and a 
full set of mouse chromosomes, allowing easy mapping of 
individual genes to specific human chromosomes. 
(D'Eustachio et al . (1983) Science 220:919-924). Somatic 
cell hybrids containing only fragments of human 

15 chromosomes can also be produced by using human 
chromosomes with translocations and deletions. 



procedure for assigning a particular sequence to a 
particular chromosome. Three or more sequences can be 

20 assigned per day using a single thermal cycler. Using 
the Tango- 77 sequences to design oligonucleotide primers, 
sublocalization can be achieved with panels of fragments 
from specific chromosomes. Other mapping strategies 
which can similarly be used to map a Tango- 77 sequence to 

25 its chromosome include in situ hybridization (described 
in Fan et al . (1990) Proc. Natl. Acad. Sci. USA 87:6223- 
27) , pre-screening with labeled flow-sorted chromosomes, 
and pre-selection by hybridization to chromosome specific 
cDNA libraries. 

30 Fluorescence in situ hybridization (FISH) of a DNA 

sequence to* a metaphase chromosomal spread can further be 
used to provide a precise chromosomal location in one 
step. Chromosome spreads can be made using cells whose 
division has been blocked in metaphase by a chemical, 

35 e.g., colcemid that disrupts the mitotic spindle. The 



PCR mapping of somatic cell hybrids is a rapid 
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chromosomes can be treated briefly with trypsin, and then 
stained with Giemsa. A pattern of light and dark bands 
develops on each chromosome, so that the chromosomes can 
be identified individually. The FISH technique can be 
5 used with a DNA sequence as short as 500 or 600 bases. 
However, clones larger than 1,000 bases have a higher 
likelihood of binding to a unique chromosomal location 
with sufficient signal intensity for simple detection. 
Preferably 1,000 bases, and more preferably 2,000 bases 

10 will suffice to get good results at a reasonable amount 
of time. For a review of .this technique, see Verma et 
al . (Human Chromosomes: A Manual of Basic Techniques 
(Pergamon Press, New York, 1988)). 

Reagents for chromosome mapping can be used 

15 individually to mark a single chromosome or a single site 
on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. 
Reagents corresponding to noncoding regions of the genes 
actually are preferred for mapping purposes. Coding 

20 sequences are more likely to be conserved within gene 
families, thus increasing the chance of cross 
hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise 
chromosomal location, the physical position of the 

25 sequence on the chromosome can be correlated with genetic 
map data. (Such data are found, for example, in V. 
McKusick, Mendelian Inheritance in Man, available on-line 
through Johns Hopkins University Welch Medical Library) . 
The relationship between genes and disease, mapped to the 

30 same chromosomal region, can then be identified through 
linkage analysis (co- inheritance of physically adjacent 
genes), described in, e.g., Egeland et al . (1987) Nature 
325:783-787. 

Moreover, differences in the DNA sequences between 
35 individuals affected and unaffected with a disease 
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associated with the Tango-77 gene can be determined. If 
a mutation is observed in some or all of the affected 
individuals but not in any unaffected individuals, then 
the mutation is likely to be the causative agent of the 
5 particular disease. Comparison of affected and 

unaffected individuals generally involves first looking 
for structural alterations in the chromosomes such as 
deletions or translocations that are visible from 
chromosome spreads or detectable using PCR based on that 
10 DNA sequence. Ultimately, complete sequencing of genes 
from several individuals can J?e performed to confirm the 
presence of a mutation and to distinguish mutations from 
polymorphisms . 

2 . Tissue Typing 

is The Tango-77 sequences of the present invention 

can also be used to identify individuals from minute 
biological samples. The United States military, for 
example, is considering the use of restriction fragment 
length polymorphism (RFLP) for identification of its 

20 personnel. In this technique, an individual's genomic 
DNA is digested with one or more restriction enzymes, and 
probed on a Southern blot to yield unique bands for 
identification. This method does not suffer from the 
current limitations of "Dog Tags" which can be lost, 

25 switched, or stolen, making positive identification 
difficult. The sequences of the present invention are 
useful as additional DNA markers for RFLP (described in 
U.S. Patent 5,272,057) . 

Furthermore, the sequences of the present 

30 invention can be used to provide an alternative technique 
which determines the actual base-by-base DNA sequence of 
selected portions of an individual's genome. Thus, the 
Tango-77 sequences described herein can be used to 
prepare two PCR primers from the 5' and 3' ends of the 
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sequences. These primers can then be used to amplify an 
individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from 
individuals, prepared in this manner, can provide unique 

5 individual identifications, as each individual will have 
a unique set of such DNA sequences due to allelic 
differences. The sequences of the present invention can 
be used to obtain such identification sequences from 
individuals and from tissue. The Tango-77 sequences of 

10 the invention uniquely represent portions of the human 
genome. Allelic variation occurs to some degree in the 
coding regions of these sequences"; 'and to a greater 
degree in the noncoding regions. It is estimated that 
allelic variation between individual humans occurs with a 

is frequency of about once per each 500 bases. Each of the 
sequences described herein can, to some degree, be used 
as a standard against which DNA from an individual can be 
compared for identification purposes. Because greater 
numbers of polymorphisms occur in the noncoding regions, 

20 fewer sequences are necessary to differentiate 

individuals. The noncoding sequences of SEQ ID N0:1 can 
comfortably provide positive individual identification 
with a panel of perhaps 10 to 1,000 primers which each 
yield a noncoding amplified sequence of 100 bases. If 

25 predicted coding sequences, such as those in SEQ ID NO: 3, 
SEQ ID NO: 6, or SEQ ID NO: 10 are used, a more appropriate 
number of primers for positive individual identification 
would be 500-2,000. 

If a panel of reagents from Tango-77 sequences 

30 described herein is used to generate a unique 

identification database for an individual, those same 
reagents can later be used to identify tissue from that 
individual. Using the unique identification database, 
positive identification of the individual, living or 

35 dead, can be made from extremely small tissue samples. 



WO 99/06426 



PCT/US98/16102 



- 73 - 

3. Use of Partial Tango-77 Sequences in Forensic 

Biology 

DNA-based identification techniques can also be 
used in forensic biology. Forensic biology is a 
5 scientific field employing genetic typing of biological 
evidence found at a crime scene as a means for positively 
identifying, for example, a perpetrator of a crime. To 
make such an identification, PCR technology can be used 
to amplify DNA sequences taken from very small biological 
io samples such as tissues, e.g., hair or skin, or body 
fluids, e.g., blood, saliva, or semen found at a crime 
- -scene. - The amplified sequence can then be compared "to a 
standard, thereby allowing identification of the origin 
of the biological sample. 
15 The sequences of the present invention can be used 

to provide polynucleotide reagents, e.g., PCR primers, 
targeted to specific loci in the human genome, which can 
enhance the reliability of DNA-based forensic 
identifications by, for example, providing another 
20 "identification marker" (i.e. another DNA sequence that 
is unique to a particular individual) . As mentioned 
above, actual base sequence information can be used for 
identification as an accurate alternative to patterns 
formed by restriction enzyme generated fragments. 
25 Sequences targeted to noncoding regions of SEQ ID NO:l 
are particularly appropriate for this use as greater 
numbers of polymorphisms occur in the noncoding regions, 
making it easier to differentiate individuals using this 
technique. Examples of polynucleotide reagents include 
30 the Tango-77 sequences or portions thereof, e.g., 

fragments derived from the noncoding regions of SEQ ID 
N0:1 having a length of at least 20 or 30 bases. 

The Tango-77 sequences described herein can 
further be used to provide polynucleotide reagents, e.g., 
35 labeled or labelable probes which can be used in, for 
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example, an in situ hybridization technique, to identify 
a specific tissue, e.g., brain tissue. This can be very 
useful in cases where a forensic pathologist is presented 
with a tissue of unknown origin. Panels of such Tango-77 
5 probes can be used to identify tissue by species and/or 
by organ type. 

In a similar fashion, these reagents, e.g., 
Tango-77 primers or probes can be used to screen tissue 
culture for contamination (i.e., screen for the presence 
10 of a mixture of different types of cells in a culture) . 

- - — 1 C. * Predictive Medicine * 

The present invention also pertains to the field 
of predictive medicine in which diagnostic assays, 
prognostic assays, pharmacogenomics , and monitoring 

is clinical trails are used for prognostic (predictive) 

purposes to thereby treat an individual prophylactically . 
Accordingly, one aspect of the present invention relates 
to diagnostic assays for determining Tango-77 protein 
and/or nucleic acid expression as well as Tango-77 

20 activity, in the context of a biological sample (e.g., 

blood, serum, cells, tissue) to thereby determine whether 
an individual is afflicted with a disease or disorder, or 
is at risk of developing a disorder, associated with 
aberrant Tango-77 expression or activity. The invention 

25 also provides for prognostic (or predictive) assays for 
determining whether an individual is at risk of 
developing a disorder associated with Tango-77 protein, 
nucleic acid expression or activity. For example, 
mutations in a Tango-77 gene can be assayed in a 

30 biological sample. Such assays can be used for 
prognostic or predictive purpose to thereby 
prophylactically treat an individual prior to the onset 
of a disorder characterized by or associated with 
Tango-77 protein, nucleic acid expression or activity. 
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Another aspect of the invention provides methods 
for determining Tango-77 protein, nucleic acid expression 
or Tango- 77 activity in an individual to thereby select 
appropriate therapeutic or prophylactic agents for that 
s individual (referred to herein as "pharmacogenomics" ) . 
Pharmacogenomics allows for the selection of agents 
(e.g./ drugs) for therapeutic or prophylactic treatment 
of an individual based on the genotype of the individual 
(e.g., the genotype of the individual examined to 
io determine the ability of the individual to respond to a 
particular agent . ) 

Yet another aspect of the invention pertains to 
monitoring the influence of agents (e.g., drugs or other 
compounds) on the expression or activity of Tango-77 in 
15 clinical trials. 

These and other agents are described in further 
detail in the following sections. 

1 . Diagnostic Assays 

An exemplary method for detecting the presence or 
20 absence of Tango-77 in a biological sample involves 
obtaining a biological sample from a test subject and 
contacting the biological sample with a compound or an 
agent capable of detecting Tango-77 protein or nucleic 
acid (e.g., mRNA, genomic DNA) that encodes Tango-77 
25 protein such that the presence of Tango-77 is detected in 
the biological sample. A preferred agent for detecting 
Tango-77 mRNA or genomic DNA is a labeled nucleic acid 
probe capable of hybridizing to Tango-77 mRNA or genomic 
DNA. The nucleic acid probe can be, for example, a full- 
30 length Tango-77 nucleic acid, such as the nucleic acid of 
SEQ ID NO: 1, SEQ ID NO:3, SEQ ID N0:6, SEQ ID NO:10 or a 
portion thereof, such as an oligonucleotide of at least 
15, 30, 50, 100, 250 or 500 nucleotides in length and 
sufficient to specifically hybridize under stringent 



WO 99/06426 



PCT/US98/16102 



- 76 



.5 



conditions to Tango- 77 mRNA or genomic DNA. Other 
suitable probes for use in the diagnostic assays of the 
invention are described herein. 

A preferred agent for detecting Tango-77 protein 
is an antibody capable of binding to Tango-77 protein, 
preferably an antibody with a detectable label. 
Antibodies can be polyclonal, or more preferably, 
monoclonal. An intact antibody, or a fragment thereof 
(e.g., Fab or F(ab') 2 ) can be used. The term "labeled", 
with regard to the probe or antibody, is intended to 
encompass direct labeling of the probe or antibody by 
coupling (i.e., physically linking) a detectable 
substance to the probe or antibody, as well as indirect 
labeling of the probe or antibody by reactivity with 
another reagent that is directly labeled. Examples of 
indirect labeling include detection of a primary antibody 
using a f luorescently labeled secondary antibody and end- 
labeling of a DNA probe with biotin such that it can be 
detected with f luorescently labeled streptavidin. The 
20 term "biological sample" is intended to include tissues, 
cells and biological fluids isolated from a subject, as 
well as tissues, cells and fluids present within a 
subject. That is, the detection method of the invention 
can be used to detect Tango-77 mRNA, protein, or genomic 
25 DNA in a biological sample in vitro as well as in vivo. 
For example, in vitro techniques for detection of 
Tango-77 mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of 
Tango-77 protein include enzyme linked immunosorbent 
30 assays (ELISAs) , Western blots, immunoprecipitations and 
immunofluorescence. In vitro techniques for detection of 
Tango-77 genomic DNA include Southern hybridizations. 
Furthermore, in vivo techniques for detection of Tango-77 
protein include introducing into a subject a labeled 
25 ant i- Tango -77 antibody. For example, the antibody can be 



WO 99/06426 



PCT/US98/16102 



- 77 - 

labeled with a radioactive marker whose presence and 
location in a subject can be detected by standard imaging 
techniques . 

In one embodiment, the biological sample contains 
5 protein molecules from the test subject. Alternatively, 
the biological sample can contain mRNA molecules from the 
test subject or genomic DNA molecules from the test 
subject. A preferred biological sample is a peripheral 
blood leukocyte sample isolated by conventional means 

10 from a subject. 

In another embodiment, the methods further involve 
obtaining a control biological sample from a control 
subject, contacting the control sample with a compound or 
agent capable of detecting Tango-77 protein, mRNA, or 

15 genomic DNA, such that the presence of Tango-77 protein, 
mRNA or genomic DNA is detected in the biological sample, 
and comparing the presence of Tango-77 protein, mRNA or 
genomic DNA in the control sample with the presence of 
Tango-77 protein, mRNA or genomic DNA in the test sample. 

20 The invention also encompasses kits for detecting 

the presence of Tango-77 in a biological sample (a test 
sample) . Such kits can be used to determine if a subject 
is suffering from or is at increased risk of developing a 
disorder associated with aberrant expression of Tango-77 

25 (e.g., an immunological disorder). For example, the kit 
can comprise a labeled compound or agent capable of 
detecting Tango-77 protein or mRNA in a biological sample 
and means for determining the amount of Tango-77 in the 
sample (e.g., an anti-Tango-77 antibody or an 

30 oligonucleotide probe which binds to DNA encoding 

Tango-77, e.g., SEQ ID NO:l or SEQ ID NO: 3 or SEQ ID 
NO:6, or SEQ ID NO:10). Kits may also include 
instruction for observing that the tested subject is 
suffering from or is at risk of developing a disorder 

35 associated with aberrant expression of Tango-77 if the 
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amount of Tango- 77 protein or mRNA is above or below a 
normal level . 

For antibody-based kits, the kit may comprise, for 
example: (1) a first antibody (e.g., attached to a solid 
5 support) which binds to Tango-77 protein; and, optionally 
(2) a second, different antibody which binds to Tango-77 
protein or the first antibody and is conjugated to a 
detectable agent. 

For oligonucleotide-based kits, the kit may 

10 comprise, for example: (1) an oligonucleotide, e.g., a 
detectably labelled oligonucleotide, which hybridizes to 
.a Tango-77 nucleic acid sequence or (2) a pair of primer's 
useful for amplifying a Tango-77 nucleic acid molecule; 
The kit may also comprise, e.g., a buffering 

is agent, a preservative, or a protein stabilizing agent. 
The kit may also comprise components necessary for 
detecting the detectable agent (e.g., an enzyme or a 
substrate) . The kit may also contain a control sample or 
a series of control samples which can be assayed and 

20 compared to the test sample contained. Each component of 
the kit is usually enclosed within an individual 
container and all of the various containers are within a 
single package along with instructions for observing 
whether the tested subject is suffering from or is at 

25 risk of developing a disorder associated with aberrant 
expression of Tango-77. 

2 . Prognostic Assays 

The methods described herein can furthermore be 
utilized as diagnostic or prognostic assays to identify 
30 subjects having or at risk of developing a disease or 
disorder associated with aberrant Tango-77 expression or 
activity. For example, the assays described herein, such 
as the preceding diagnostic assays or the following 
assays, can be utilized to identify a subject having or 
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at risk of developing a disorder associated with aberrant 
expression or activity. Thus, the present invention 
provides a method in which a test sample is obtained from 
a subject and Tango-77 protein or nucleic acid (e.g., 

5 mRNA, genomic DNA) is detected, wherein the presence of 
Tango-77 protein or nucleic acid is diagnostic for a 
subject having or at risk of developing a disease or 
disorder associated with aberrant Tango-77 expression or 
activity. As used herein, a "test sample" refers to a 

10 biological sample obtained from a subject of interest. 
For example, a test sample can be a biological fluid 
(e.g., serum), cell sample, or tissue. 

Furthermore, the prognostic assays described 
herein can be used to determine whether a subject can be 

is administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small 
molecule, or other drug candidate) to treat a disease or 
disorder associated with aberrant Tango-77 expression or 
activity. For example, such methods can be used to 

20 determine whether a subject can be effectively treated 
with a specific agent or class of agents (e.g., agents of 
a type which decrease Tango-77 activity) . Thus, the 
present invention provides methods for determining 
whether a subject can be effectively treated with an 

25 agent for a disorder associated with aberrant Tango-77 
expression or activity in which a test sample is obtained 
and Tango-77 protein or nucleic acid is detected (e.g., 
wherein the presence of Tango-77 protein or nucleic acid 
is diagnostic for a subject that can be administered the 

30 agent to treat a disorder associated with aberrant 
Tango-77 expression or activity) . 

The methods of the invention can also be used to 
detect genetic lesions or mutations in a Tango-77 gene, 
thereby determining if a subject with the lesioned gene 

35 is at risk for a disorder characterized by aberrant 
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inflammation. In preferred embodiments, the methods 
include detecting, in a sample of cells from the subject, 
the presence or absence of a genetic lesion or mutation 
characterized by at least one of an alteration affecting 
5 the integrity of a gene encoding a Tango- 77 -protein, or 
the mis -expression of the Tango-77 gene. For example, 
such genetic lesions or mutations can be detected by 
ascertaining the existence of at least one of: 1) a 
deletion of one or more nucleotides from a Tango-77 gene; 

10 2) an addition of one or more nucleotides to a Tango-77 
gene; 3) a substitution of one or more nucleotides. of a 
Tango-77 gene; 4) a chromosomal rearrangement of a 
Tango-77 gene; 5) an alteration in the level of a 
messenger RNA transcript of a Tango-77 gene; 6) an 

15 aberrant modification of a Tango-77 gene, such as of the 
methylation pattern of the genomic DNA; 7) the presence 
of a non-wild type splicing pattern of a messenger RNA 
transcript of a Tango-77 gene; 8) a non-wild type level 
of a Tango-77-protein; 9) an allelic loss of a Tango-77 

20 gene, and 10) an inappropriate post-translational 

modification of a Tango- 77 -protein. As described herein, 
there are a large number of assay techniques known in the 
art which can be used for detecting lesions or mutations 
in a Tango-77 gene. A preferred biological sample is a 

25 peripheral blood leukocyte sample isolated by 
conventional means from a subject. 



involves the use of a probe/primer in a polymerase chain 
reaction (PCR) (see, e.g., U.S. Patent Nos. 4,683,195 and 

30 4,683,202), such as anchor PCR or RACE PCR, or, 

alternatively, in a ligation chain reaction (LCR) (see, 
e.g., Landegran et al. (1988) Science 241:1077-1080; and 
Nakazawa et al . (1994) Proc. Natl. Acad. Sci. USA 91:360- 
364) , the latter of which can be particularly useful for 

35 detecting point mutations in the Tango-77-gene (see, 



In certain embodiments, detection of the lesion 
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e.g., Abravaya et al. (1995) Nucleic Acids Res. 23:675- 
682) . This method can include the steps of collecting a 
sample of cells from a patient, isolating nucleic acid 
(e.g., genomic, mRNA or both) from the cells of the 
5 sample, contacting the nucleic acid sample with one or 
more primers which specifically hybridize to a Tango-77 
gene under conditions such that hybridization and 
amplification of the Tango-77-gene (if present) occurs, 
and detecting the presence or absence of an amplification 

10 product, or detecting the size of the amplification 

product and comparing the length to a control sample. It 
is anticipated that PCR and/or LCR may be desirable to 
use as a preliminary amplification step in conjunction 
with any of the techniques used for detecting mutations 

is described herein. 

Alternative amplification methods include: self 
sustained sequence replication (Guatelli et al . (1990) 
Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional 
amplification system (Kwoh, et al . (1989) Proc. Natl. 

20 Acad. Sci. USA 86:1173-1177), Q-Beta Replicase (Lizardi 
et al. (1988) Bio/Technology 6:1197), or any other 
nucleic acid amplification method, followed by the 
detection of the amplified molecules using techniques 
well known to those of skill in the art. These detection 

25 schemes are especially useful for the detection of 

nucleic acid molecules if such molecules are present in 
very low numbers . 



Tango-77 gene from a sample cell can be identified by 
30 alterations in restriction enzyme cleavage patterns. For 
example, sample and control DNA is isolated, amplified 
(optionally) , digested with one or more restriction 
endonucl eases, and fragment length sizes are determined 
by gel electrophoresis and compared. Differences in 
35 fragment length sizes between sample and control DNA 



In an alternative embodiment, mutations in a 
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indicates mutations in the sample DNA. Moreover, the use 
of sequence specific ribozymes (see, e.g., U.S. Patent 
No. 5,498,531) can be used to score for the presence of 
specific mutations by development or loss of a ribozyme 
5 cleavage site. 

In other embodiments, genetic mutations in 
Tango- 77 can be identified by hybridizing a sample and 
control nucleic acids, e.g., DNA or RNA, to high density 
arrays containing hundreds or thousands of 

10 oligonucleotides probes (Cronin et al . (1996) Human 

Mutation 7:244-255; Kozal et al . (1996) Nature Medicine 
2:753-759). For example, genetic mutations in Tango-77 
can be identified in two-dimensional arrays containing 
light -generated DNA probes as described in Cronin et al . 

15 supra. Briefly, a first hybridization array of probes 
can be used to scan through long stretches of DNA in a 
sample and control to identify base changes between the 
sequences by making linear arrays of sequential 
overlapping probes. This step allows the identification 

20 of point mutations. This step is followed by a second 
hybridization array that allows the characterization of 
specific mutations by using smaller, specialized probe 
arrays complementary to all variants or mutations 
detected. Each mutation array is composed of parallel 

25 probe sets, one complementary to the wild- type gene and 
the other complementary to the mutant gene. 

In yet another embodiment, any of a variety of 
sequencing reactions known in the art can be used to 
directly sequence the Tango-77 gene and detect mutations 

30 by comparing the sequence of the sample Tango-77 with the 
corresponding wild-type (control) sequence. Examples of 
sequencing reactions include those based on techniques 
developed by Maxim and Gilbert ((1977) Proc. Natl. Acad. 
Sci. USA 74:560) or Sanger ((1977) Proc. Natl. Acad. 

35 Sci. USA 74:5463). It is also contemplated that any of a 
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variety of automated sequencing procedures can be 
utilized when performing the diagnostic assays ( (1995) 
Bio/Techniques 19:448), including sequencing by mass 
spectrometry (see, e.g., PCT Publication No. WO 94/16101; 
5 Cohen et al . (1996) Adv. Chromatogr. 36:127-162; and 
Griffin et al . (1993) Appl. Biochem. Biotechnol. 38:147- 
159) . 

Other methods for detecting mutations in the 
Tango-77 gene include methods in which protection from 

10 cleavage agents is used to detect mismatched bases in 
RNA/RNA or RNA/DNA heteroduplexes. (Myers et al . (1985) • 
Science 230:1242). In general, the technique of 
"mismatch cleavage" entails providing heteroduplexes 
formed by hybridizing (labeled) RNA or DNA containing the 

15 wild-type Tango-77 sequence with potentially mutant RNA 
or DNA obtained from a tissue sample. The double- 
stranded duplexes are treated with an agent which cleaves 
single -stranded regions of the duplex such as which will 
exist due to basepair mismatches between the control and 

20 sample strands. RNA/DNA duplexes can be treated with 
RNase to digest mismatched regions, and DNA/ DNA hybrids 
can be treated with SI nuclease to digest mismatched 
regions. In other embodiments, either DNA/DNA or RNA/DNA 
duplexes can be treated with hydroxylamine or osmium 

25 tetroxide and with piperidine in order to digest 

mismatched regions. After digestion of the mismatched 
regions, the resulting material is then separated by size 
on denaturing polyacrylamide gels to determine the site 
of mutation. See, e.g., Cotton et al. (1988) Proc. Natl. 

30 Acad. Sci. USA 85:4397; Saleeba et al . (1992) Methods 
Enzymol. 217:286-295. In a preferred embodiment, the 
control DNA or RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage 
reaction employs one or more proteins that recognize 

35 mismatched base pairs in double -stranded DNA (so called 
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"DNA mismatch repair" enzymes) in defined systems for 
detecting and mapping point mutations in Tango- 77 cDNAs 
obtained from samples of cells. For example, the mutY 
enzyme of E. coli cleaves A at G/A mismatches and the 
5 thymidine DNA glycosylase from HeLa cells cleaves T at 
G/T mismatches (Hsu et al . (1994) Carcinogenesis 15:1657- 
1662) . According to an exemplary embodiment, a probe 
based on a Tango-77 sequence, e.g., a wild-type Tango-77 
sequence, is hybridized to a cDNA or other DNA product 
10 from a test cell(s) . The duplex is treated with a DNA 
mismatch repair enzyme, and the cleavage products, if 
any, can be detected from electrophoresis protocols or 
the like. See, e.g., U.S. Patent No. 5,459,039. 



is electrophoretic mobility will be used to identify 

mutations in Tango-77 genes. For example, single strand 
conformation polymorphism (SSCP) may be used to detect 
differences in electrophoretic mobility between mutant 
and wild type nucleic acids (Orita et al . (1989) Proc. 

20 Natl. Acad. Sci. USA 86:2766; see also Cotton (1993) 

Mil tat. Res. 285:125-144; Hayashi (1992) Genet Anal Tech 
Appl 9:73-79) . Single- stranded DNA fragments of sample 
and control Tango-77 nucleic acids will be denatured and 
allowed to renature. The secondary structure of single- 

25 stranded nucleic acids varies according to sequence, and 
the resulting alteration in electrophoretic mobility 
enables the detection of even a single base change. The 
DNA fragments may be labeled or detected with labeled 
probes. The sensitivity of the assay may be enhanced by 

30 using RNA (rather than DNA) , in which the secondary 

structure is more sensitive to a change in sequence. In 
a preferred embodiment, the subject method utilizes 
heteroduplex analysis to separate double stranded 
heteroduplex molecules on the basis of changes in 



In other embodiments, alterations in 
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electrophoretic mobility (Keen et al . (1991) Trends Genet 
7:5). 

In yet another embodiment, the movement of mutant 
or wild-type fragments in polyacryl amide gels containing 
5 a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE) (Myers et al . (1985) 
Nature 313:495) , When DGGE is used as the method of 
analysis, DNA will be modified to insure that it does not 
completely denature, for example by adding a GC clamp of 

10 approximately 40 bp of high-melting GC-rich DNA by PCR. 
In a further embodiment, a temperature gradient is used 
in place of a denaturing gradient to identify differences 
in the mobility of control and sample DNA (Rosenbaum and 
Reissner (1987) Biophys. Chem. 265:12753). 

15 Examples of other techniques for detecting point 

mutations include, but are not limited to, selective 
oligonucleotide hybridization, selective amplification, 
or selective primer extension. For example, 
oligonucleotide primers may be prepared in which the 

20 known mutation is placed centrally and then hybridized to 
target DNA under conditions which permit hybridization 
only if a perfect match is found (Saiki et al . (1986) 
Nature 324:163); Saiki et al . (1989) Proc. Natl. Acad. 
Sci. USA 86:6230). Such allele specific oligonucleotides 

25 are hybridized to PCR amplified target DNA or a number of 
different mutations when the oligonucleotides are 
attached to the hybridizing membrane and hybridized with 
labeled target DNA. 

Alternatively, allele specific amplification 

30 technology which depends on selective PCR amplification 
may be used in conjunction with the instant invention. 
Oligonucleotides used as primers for specific 
amplification may carry the mutation of interest in the 
center of the molecule (so that amplification depends on 

35 differential hybridization) (Gibbs et al . (1989) Nucleic 
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Acids Res. 17:2437-244 8) or at the extreme 3' end of one 
primer where, under appropriate conditions, mismatch can 
prevent or reduce polymerase extension (Prossner (1993) 
Tibtech 11:238). In addition, it may be desirable to 
5 introduce a novel restriction site in the region of the 
mutation to create cleavage -based detection (Gasparini et 
al. (1992) Mol. Cell Probes 6:1). It is anticipated that 
in certain embodiments amplification may also be 
performed using Taq ligase for amplification (Barany 

10 (1991) Proc. Natl. Acad. Sci USA 88:189). In such cases, 
ligation will occur only if there is a perfect match at . 
the 3' end of the 5' sequence making it possible to 
detect the presence of a known mutation at a specific 
site by looking for the presence or absence of 

is amplification. 

The methods described herein may be performed, for 
example, by utilizing pre-packaged diagnostic kits 
comprising at least one probe nucleic acid or antibody 
reagent described herein, which may be conveniently used, 

20 e.g., in clinical settings to diagnose patients 

exhibiting symptoms or family history of a disease or 
illness involving a Tango- 77 gene. 

Furthermore, any cell type or tissue, preferably 
peripheral blood leukocytes, in which Tango- 7 7 is 

25 expressed may be utilized in the prognostic assays 
described herein. 

3 . Pharmacoaenomics 

Agents, or modulators which have a stimulatory or 
30 inhibitory effect on Tango-77 activity (e.g., Tango-77 
gene expression) as identified by a screening assay 
described herein can be administered to individuals to 
treat (prophylactically or therapeutically) disorders 
(e.g., acute or chronic inflammation and asthma) 
35 associated with aberrant Tango-77 activity. In 
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conjunction with such treatment, the pharmacogenomics 
(i.e., the study of the relationship between an 
individual's genotype and that individual's response to a 
foreign compound or drug) of the individual may be 
5 considered. Differences in metabolism of therapeutics 
can lead to severe toxicity or therapeutic failure by 
altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, 
the pharmacogenomics of the individual permits the 

10 selection of effective agents (e.g., drugs) for 
prophylactic or therapeutic treatments based on a 
consideration of the individual's genotype. Such 
pharmacogenomics can further be used to determine 
appropriate dosages and therapeutic regimens. 

15 Accordingly, the activity of Tango- 77 protein, expression 
of Tango-77 nucleic acid, or mutation content of Tango-77 
genes in an individual can be determined to thereby 
select appropriate agent (s) for therapeutic or 
prophylactic treatment of the individual . 

20 Pharmacogenomics deals with clinically significant 

hereditary variations in the response to drugs due to 
altered drug disposition and abnormal action in affected 
persons. See, e.g., Linder (1997) Clin. Chem. 
43 (2) : 254-266 . In general, two types of pharmacogenetic 

25 conditions can be differentiated. Genetic conditions 

transmitted as a single factor altering the way drugs act 
on the body are referred to as "altered drug action." 
Genetic conditions transmitted as single factors altering 
the way the body acts on drugs are referred to as 

30 "altered drug metabolism". These pharmacogenetic 
conditions can occur either as rare defects or as 
polymorphisms. For example, glucose -6 -phosphate 
dehydrogenase deficiency (G6PD) is a common inherited 
enzymopathy in which the main clinical complication is 

35 haemolysis after ingestion of oxidant drugs (anti- 
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malarials, sulfonamides, analgesics, nitrofurans) and 
consumption of fava beans. 

As an illustrative embodiment, the activity of 
drug metabolizing enzymes is a major determinant of both 
5 the intensity and duration of drug action. The discovery 
of genetic polymorphisms of drug metabolizing enzymes 
(e.g., N-acetyltransferase 2 (NAT 2) and cytochrome P450 
enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug 

10 effects or show exaggerated drug response and serious 
toxicity after taking the standard and safe dose of a 
drug. These polymorphisms are expressed in two 
phenotypes in the population, the extensive metabolizer 
(EM) and poor metabolizer (PM) . The prevalence of PM is 

15 different among different populations. For example, the 
gene coding for CYP2D6 is highly polymorphic and several 
mutations have been identified in PM, which all lead to 
the absence of functional CYP2D6 . Poor metabolizers of 
CYP2D6 and CYP2C19 quite frequently experience 

20 exaggerated drug response and side effects when they 
receive standard doses. If a metabolite is the active 
therapeutic moiety, PM shows no therapeutic response, as 
demonstrated for the analgesic effect of codeine mediated 
by its CYP2D6-f ormed metabolite morphine. The other 

25 extreme are the so called ultra-rapid metabolizers who do 
not respond to standard doses. Recently, the molecular 
basis of ultra-rapid metabolism has been identified to be 
due to CYP2D6 gene amplification. 

Thus, the activity of Tango- 77 protein, expression 

30 of Tango-77 nucleic acid, or mutation content of Tango-77 
genes in an individual can be determined to thereby 
select appropriate agent (s) for therapeutic or 
prophylactic treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping 

35 of polymorphic alleles encoding drug -metabolizing enzymes 
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to the identification of an individual's drug 
responsiveness phenotype. This knowledge, when applied 
to dosing or drug selection, can avoid adverse reactions 
or therapeutic failure and thus enhance therapeutic or 
5 prophylactic efficiency when treating a subject with a 
Tango-77 modulator, such as a modulator identified by one 
of the exemplary screening assays described herein. 

4 . Monitoring of Effects During Clinical Trials 
Monitoring the influence of agents (e.g., drugs, 

10 compounds) on the expression or activity of Tango-77 

(e.g., the ability to modulate aberrant inflammation) can 
be applied not only in basic drug screening, but also in 
clinical trials. For example, the effectiveness of an 
agent, as determined by a screening assay as described 

15 herein, to increase Tango-77 gene expression, increase 
protein levels, or upregulate Tango-77 activity, can be 
monitored in clinical trials of subjects exhibiting 
decreased Tango-77 gene expression, decreased protein 
levels, or downregulated Tango-77 activity. 

20 Alternatively, the effectiveness of an agent, as 

determined by a screening assay, to decrease Tango-77 
gene expression, decrease protein levels, or downregulate 
Tango-77 activity, can be monitored in clinical trials of 
subjects exhibiting increased Tango-77 gene expression, 

25 increased protein levels, or upregulated Tango-77 
activity. 

For example, and not by way of limitation, genes, 
including Tango-77, that are modulated in cells by 
treatment with an agent (e.g., compound, drug or small 
30 molecule) which modulates Tango-77 activity (e.g., as 
identified in a screening assay described herein) can be 
identified. Thus, to study the effect of agents on 
cellular proliferation disorders, for example, in a 
clinical trial, cells can be isolated and RNA prepared 
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and analyzed for the levels of expression of Tango- 77 and 
other genes implicated in the disorder. The levels of 
gene expression (i.e., a gene expression pattern) can be 
quantified by Northern blot analysis or RT-PCR, as 
5 described herein, or alternatively by measuring the 
amount of protein produced, by one of the methods as 
described herein, or by measuring the levels of activity 
of Tango-77 or other genes. In this way, the gene 
expression pattern can serve as a marker, indicative of 
10 the physiological response of the cells to the agent. 
Accordingly, this response state may be determined 
before, and at various points during, treatment of the 
individual with the agent. 



is provides a method for monitoring the effectiveness of 
treatment of a subject with an agent (e.g., an agonist, 
antagonist, peptidomimetic, protein, peptide, nucleic 
acid, small molecule, or other drug candidate identified 
by the screening assays described herein) comprising the 

20 steps of (i) obtaining a pre-administration sample from a 
subject prior to administration of the agent; (ii) 
detecting the level of expression of a Tango-77 protein, 
mRNA, or genomic DNA in the preadministration sample; 
(iii) obtaining one or more post -administration samples 

25 from the subject; (iv) detecting the level of expression 
or activity of the Tango-77 protein, mRNA, or genomic DNA 
in the post -administration samples; (v) comparing the 
level of expression or activity of the Tango-77 protein, 
mRNA, or genomic DNA in the pre-administration sample 

30 with the Tango-77 protein, mRNA, or genomic DNA in the 
post administration sample or samples; and (vi) altering 
the administration of the agent to the subject 
accordingly. For example, increased administration of 
the agent may be desirable to increase the expression or 

35 activity of Tango-77 to higher levels than detected, 



In a preferred embodiment, the present invention 
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i.e., to increase the effectiveness of the agent. 
Alternatively, decreased administration of the agent may 
be desirable to decrease expression or activity of 
Tango-77 to lower levels than detected, i.e., to decrease 
5 the effectiveness of the agent. 



prophylactic and therapeutic methods of treating a 
subject at risk of (or susceptible to) developing or 

10 having a disorder associated with aberrant Tango-77 
expression or activity. Alternatively, disorders 
associated with aberrant IL-1 production can be treated 
with Tango-77. Such disorders include acute and chronic 
inflammation,, asthma, some classes of arthritis, 

is autoimmune diabetes, systemic lupus erythematosus and 
inflammatory bowel disease . 

1 . Prophylactic Methods 
In one aspect, the invention provides a method for 
preventing in a subject, a disease or condition 

20 associated with an aberrant Tango-77 expression or 

activity (or aberrant IL-1 expression or activity) , by 
administering to the subject an agent which modulates 
Tango-77 expression or at least one Tango-77 activity. 
Subjects at risk for a disease which is caused or 

25 contributed to by aberrant Tango-77 expression or 
activity can be identified by, for example, any or a 
combination of diagnostic or prognostic assays as 
described herein. Administration of a prophylactic agent 
can occur prior to the manifestation of symptoms 

30 characteristic of the Tango-77 aberrancy, such that a 
disease or disorder is prevented or, alternatively, 
delayed in its progression. Depending on the type of 
Tango-77 aberrancy, for example, a Tango-77 agonist or 
Tango-77 antagonist agent can be used for treating the 



Methods of Treatment 



The present invention provides for both 
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subject. The appropriate agent can be determined based 
on screening assays described herein. 



5 methods of modulating Tango-77 expression or activity for 
therapeutic purposes. The modulatory method of the 
invention involves contacting a cell with an agent that 
modulates one or more of the activities of Tango-77 
protein activity associated with the cell. An agent that 

10 modulates Tango-77 protein activity can be an agent as 
described herein, such as a nucleic acid or a protein, a 
naturally-occurring cognate ligand of a Tango-77 protein, 
a peptide, a Tango-77 peptidomimetic, or other small 
molecule. In one embodiment, the agent stimulates one or 

is more of the biological activities of Tango-77 protein. 
Examples of such stimulatory agents include active 
Tango-77 protein and a nucleic acid molecule encoding 
Tango-77 that has been introduced into the cell. In 
another embodiment, the agent inhibits one or more of the 

20 biological activities of Tango-77 protein. Examples of 
such inhibitory agents include antisense Tango-77 nucleic 
acid molecules and anti -Tango- 77 antibodies. These 
modulatory methods can be performed in vitro (e.g., by 
culturing the cell with the agent) or, alternatively, in 

2$ vivo (e.g, by administering the agent to a subject) . As 
such, the present invention provides methods of treating 
an individual afflicted with a disease or disorder 
characterized by aberrant expression or activity of a 
Tango-77 protein or nucleic acid molecule. In one 

30 embodiment, the method involves administering an agent 

(e.g., an agent identified by a screening assay described 
herein), or combination of agents that modulates (e.g., 
upregulates or downregulates) Tango-77 expression or 
activity. In another embodiment, the method involves 



Therapeutic Methods 



Another aspect of the invention pertains to 
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administering a Tango- 77 protein or nucleic acid molecule 
as therapy to compensate for reduced or aberrant Tango- 77 
expression or activity. 

Stimulation of Tango- 77 activity is desirable in 
5 situations in which Tango- 77 is abnormally downregulated 
and/or in which increased Tango-77 activity is likely to 
have a beneficial effect. Conversely, inhibition of 
Tango-77 activity is desirable in situations in which 
Tango-77 is abnormally upregulated and/or in which 

10 decreased Tango-77 activity is likely to have a 
beneficial effect. 

This invention is further illustrated by the 
following examples which should not be construed as 
limiting. The contents of all references, patents and 

is published patent applications cited throughout this 
application are hereby incorporated by reference. 

EXAMPLES 

Example 1: Isolation and Characterization of Human 
Tango-77 cDNAs 

20 Cytokine genes IL-la, IL-10 and IL-lra have been 

found to be closely clustered on chromosome 2 , i.e., 
IL-la, IL-ljS and IL-lra are located within 450 kb of each 
other. BAC clones containing IL-la and IL-1/3 were used 
to identify other proximal unknown cytokine genes. To do 

25 this, a BAC clone containing IL-la and IL-1/3 was selected 
from a BAC library (Research Genetics, Hunt svi lie, 
Alabama) using specific primers designed against IL-la 
and IL-1/3. The DNA from the BAC was extracted and used 
to make a random- sheared genomic library. From this BAC 

30 library, 4000 clones were selected for sequencing. The 
resulting genomic sequences were then assembled into 
contigs and used to screen proprietary and public data 
bases. One genomic contig was found to contain two 
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segments of sequences which resemble IL-lra. These two 
segments are potential exons of Tango-77 gene. 

Two PCR primers were then designed from the two 
potential exons and used to screen a panel of cDNA 
5 libraries for the expression of a Tango-77 message. A 
cDNA library from TNF-a treated human lung epithelia 
showed a positive band of the predicted size (i.e., if 
the two exons are spliced together) . Using the PCR 
fragment as a probe, a single cDNA clone was isolated 

10 from the same library. This cDNA contains an insert of 
989 bp. The cDNA clone contains three possible open 
reading frames. The first open reading frame encompasses 
534 nucleotides (nucleotides 356-889 of SEQ ID NO:l; SEQ 
ID NO: 3) and encodes a 178 amino acid protein (SEQ ID 

15 NO: 2) . This protein may include a predicted signal 
sequence of about 63 amino acids (from amino acid 1 to 
about amino acid 63 of SEQ ID N0:2 (SEQ ID NO:4)) and a 
predicted mature protein of about 115 amino acids (from 
about amino acid 64 to amino acid 178 of SEQ ID NO: 2 (SEQ 

20 ID NO:5) ) . 

The second putative nucleotide open reading frame 
encompasses 498 nucleotides (nucleotides 389-889 of SEQ 
ID NO:l; SEQ ID NO: 6) and encodes a 167 amino acid 
protein (SEQ ID NO: 7) . This protein includes a predicted 

25 signal sequence of about 52 amino acids (from amino acid 
1 to about amino acid 52 of SEQ ID NO:7 (SEQ ID N0:8)) 
and a predicted mature protein of about 115 amino acids 
(from about amino acid 53 to amino acid 167 of SEQ ID 
NO:7 (SEQ ID N0:9) ) . 

30 The third open reading frame (nucleotides 372-889 

of SEQ ID NO:l; SEQ ID N0:10) encompasses 408 nucleotides 
and encodes a 136 amino acid protein (SEQ ID NO:ll). 
This protein includes a predicted signal sequence of 
about 21 amino acids (from amino acid 1 to about amino 

35 acid 21 of SEQ ID NO: 11 (SEQ ID NO:12)) and a predicted 
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mature protein of about 115 amino acids (from about amino 
acid 22 to amino acid 136 of SEQ ID NO: 11 (SEQ ID 
NO:13) ) . 

Tango-77 is predicted to be 35% identical to human 
5 IL-lra at the amino acid level. 

Example 2: Expression of Tango-77 mRNA in Human Tissues 

The expression of Tango-77 was analyzed using 
Northern blot hybridization. A PCR generated 989 bp 
Tango-77 product was radioactively labeled with 32 P-dCTP 

10 using the Prime- It kit .(Stratagene; La Jolla, CA) 

according to the instructions of the supplier. Filters 
containing human mRNA (MTNI and MTNII: Clontech; Palo 
Alto, CA) were probed in ExpressHyb hybridization 
solution (Clontech) and washed at high stringency 

is according to manufacturer's recommendations. 

Tango-77 mRNA was not detected in any unstimulated 
tissues (brain, liver, spleen, skeletal muscle, testis, 
pancreas, heart, kidney and peripheral blood leukocytes) 
mRNA on Clontech Northern blots. 

20 Over 96 cDNA libraries were then tested for the 

presence of Tango-77 using PCR amplification. Only three 
libraries displayed a positive signal. These libraries 
were the TNFcy-treated bronchoepithelium, TNFof-treated SSC 
cell line and anti-CD3 -treated T cells. 

25 Example 3: Characterization of Tango-77 Proteins 

In this example, the predicted amino acid sequence 
of human Tango-77 protein was compared to the amino acid 
sequence of known protein IL-lra. In addition, the 
molecular weight of the human Tango-77 proteins was 
30 predicted. 

The human Tango-77 cDNA (Figure 1; SEQ ID NO:l) 
isolated as described above encodes a 178 amino acid 
protein (Figure 1; SEQ ID NO: 2) or a 167 amino acid 
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protein (Figure 1; SEQ ID NO: 7) or a 136 amino acid 
protein (Figure 1; SEQ ID NO: 11). The signal peptide 
prediction program SIGNALP Optimized Tool (Nielsen et al . 
(1997) Protein Engineering 10:1-6) predicted that 
5 Tango-77 includes a 63 amino acid signal peptide (amino 
acid 1 to about amino acid 63 of SEQ ID NO: 2 (SEQ ID 
NO:4)) preceding the 115 mature protein; or preceding the 
115 mature protein (about amino acid 52 to amino acid 167 
of SEQ ID NO:7 (SEQ ID NO:8)); or preceding the 115 

io mature protein (about amino acid 21 to amino acid 136 of 
SEQ ID NO: 11 ; SEQ ID NOj 12 ) . 

As shown in Figure 2, Tango-77 has a region of 
homology to IL-lra (SEQ ID NO: 14) . 

Mature Tango-77 has a predicted MW of about 13 kDa 

is and the predicted MW for the immature Tango-77 is 19.6 
kDa, 18.5 kDa or 15.2 kDa, not including post- 
translational modifications. 

Example 4: Preparation of Tango-77 Proteins 

Recombinant Tango-77 can be produced in a variety 

20 of expression systems. For example, the mature Tango-77 
peptide can be expressed as a recombinant glutathione- S- 
transferase (GST) fusion protein in E. coli and the 
fusion protein can be isolated and characterized^ 
Specifically, as described above, Tango-77 can be fused 

25 to GST and this fusion protein can be expressed in E. 
coli strain PEB199. Expression of the GST-Tango- 77 
fusion protein in PEB199 can be induced with IPTG. The 
recombinant fusion protein can be purified from crude 
bacterial lysates of the induced PEB199 strain by 

30 affinity chromatography on glutathione beads. 
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Example 5: Alternatively spliced forms of IL-lra and 
Tango- 77 

Computer program Procrustes (Gelfand et al . , 1996, 
Proc. Natl. Acad. Sci. USA, 93:9061-9066) is an alignment 
5 algorithm that predicts the presence of alternatively 
spliced exons for a protein of interest in a stretch of 
genomic DNA. Using the IL-lra sequence, Proscustes was 
used to search for the presence of additional sequences 
that might encode for alternatively spliced forms of IL- 

10 Ira in the two overlapping BAC genomic sequences (see 
Fig. 3 and Fig. 4) . Potential sequences that encode 
variant exons for IL-lra were identified. These 
predicted exons aligned well with the N-terminal region 
of IL-lra, but were not present in Tango-77. The results 

15 from Procrustes predicts the existence of more spliced 
forms of IL-lra. 

Furthermore, Procrustes also predicted an 
additional sequence in BAC1 and BAC2 that encodes an 
alternatively spliced exon for Tango-77 (T77-procrustes; 

20 Fig. 5) . This predicted splice variant form of Tango-77, 
T77-procrustes, was aligned with Tango-77 (Fig. 6) and 
with IL-lra and IL-1/3 (Fig. 7). 

PCR primers within this sequence can be used to 
generate a product that can be used to screen a panel of 

25 cDNA libraries using standard techniques. Suitable cDNA 
libraries include libraries made from TNFa- treated 
bronchoepithelium, TNFa-treated SSC cell line and anti- 
CD3- treated T cells. The resulting cDNA clone (s) can be 
isolated from the library and sequenced to identify 

30 additional Tango-77 cDNAs. 
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Equivalents 

Those skilled in the art will recognize, or be 
able to ascertain using no more than routine 
experimentation, many equivalents to the specific 
5 embodiments of the invention described herein. Such 
equivalents are intended to be encompassed by the 
following claims. 
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What is claimed is: 

1. An isolated nucleic acid molecule selected 
from the group consisting of : 

a) a nucleic acid molecule comprising a 
5 nucleotide sequence which is at least 45% identical to 
the nucleotide sequence of SEQ ID NO:l, SEQ ID NO: 3, SEQ 
ID NO: 6, SEQ ID NO: 10, the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98807, or a 
complement thereof; 

10 b) a nucleic acid molecule comprising a fragment 

of at least 300 nucleotides of the nucleotide sequence of 
SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:6, SEQ ID NO: 10, the 
cDNA insert of the plasmid deposited with ATCC as 
Accession Number 98807, or a complement thereof; 

is c) nucleic acid molecule which encodes a 

polypeptide comprising the amino acid sequence of SEQ ID 
NO:2, SEQ ID N0:4, SEQ ID NO:5, SEQ ID N0:7, SEQ ID NO:8, 
SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO: 13, or 
an amino acid sequence encoded by the cDNA insert of the 

20 plasmid deposited with ATCC as Accession Number 98807; 

d) a nucleic acid molecule which encodes a 
fragment of a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID 
NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID 

25 NO: 12, SEQ ID NO: 13, wherein the fragment comprises at 
least 15 contiguous amino acids of SEQ ID NO: 2, SEQ ID 
NO:4, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or the 
polypeptide encoded by the cDNA insert of the plasmid 

30 deposited with ATCC as Accession Number 98807; and 

e) a nucleic acid molecule which encodes a 
naturally occurring allelic variant of a polypeptide 
comprising the amino acid sequence of SEQ ID NO: 2, SEQ ID 
NO:4, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, 
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SEQ ID NO: 11/ SEQ ID NO: 12 , SEQ ID NO: 13, or an amino 
acid sequence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98807, wherein 
the nucleic acid molecule hybridizes to a nucleic acid 
5 molecule comprising SEQ ID NO:l, SEQ ID NO: 3, SEQ ID 
NO: 6, SEQ ID NO: 10, or the complement thereof under 
stringent conditions . 

2. The isolated nucleic acid molecule of claim 
1, which is selected from the group consisting of: 

io a) a nucleic acid comprising the nucleotide 

sequence of SEQ ID N0:1, SEQ ID NO:3, SEQ ID NO: 6, or SEQ 
ID NO: 10 or the cDNA insert of the plasmid deposited with 
ATCC as Accession Number 98807, or a complement thereof; 
and 

is b) a nucleic acid molecule which encodes a 

polypeptide comprising the amino acid sequence of SEQ ID 
NO:2, SEQ ID NO:4, SEQ ID NO:5, SEQ ID N0:7, SEQ ID NO:8, 
SEQ ID NO:9, SEQ ID NO: 11, SEQ ID NO:12, SEQ ID NO:13, or 
an amino acid sequence encoded by the cDNA insert of the 

20 plasmid deposited with ATCC as Accession Number 98807, 

3 . The nucleic acid molecule of claim 1 further 
comprising vector nucleic acid sequences. 

4 . The nucleic acid molecule of claim 1 further 
comprising nucleic acid sequences encoding a heterologous 

25 polypeptide. 

5. A host cell containing the nucleic acid 
molecule of claim 1. 

6. The host cell of claim 5 which is a mammalian 
host cell . 
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7. A non-human mammalian host cell containing 
the nucleic acid molecule of claim 1. 

8. An isolated polypeptide selected from the 
group consisting of: 

5 a) a fragment of a polypeptide comprising the 

amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
NO:5, SEQ ID N0:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID 
NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, wherein the fragment 
comprises at least 15 contiguous amino acids of SEQ ID 
10 . NO:2, SEQ ID NQ:4, SEQ ID_NO:5,~SEQ ID NO:7, SEQ ID NO:8, 
SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO:12, or SEQ ID NO:13. 

b) a naturally occurring allelic variant of a 
polypeptide comprising the amino acid sequence of SEQ. ID 
NO:2, SEQ ID NO: 4 , SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:8, 

15 SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:13, or 
an amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with ATCC as Accession Number 98807, 
wherein the polypeptide is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid molecule 

20 comprising SEQ ID N0:1, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID 
NO: 10 or the complement thereof under stringent 
conditions; 

c) a polypeptide which is encoded by a nucleic 
acid molecule comprising a nucleotide sequence which is 

25 at least 55% identical to a nucleic acid comprising the 
nucleotide sequence of SEQ ID NO:l, SEQ ID NO: 3, SEQ ID 
NO:6, or SEQ ID NO:10. 

9. The isolated polypeptide of claim 8 
comprising the amino acid sequence of SEQ ID NO:2, SEQ ID 

30 NO:4, SEQ ID NO:5, SEQ ID N0:7, SEQ ID NO:8, SEQ ID NO:9, 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or an amino 
acid sequence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98807. 
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10. The polypeptide of claim 8 further comprising 
heterologous amino acid sequences. 

11. An antibody which selectively binds to a 
polypeptide of claim 8. 

5 12. A method for producing a polypeptide selected 

from the group consisting of : 

a) a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID 
NO: 7, SEQ ID NO: 8,. SEQ ID NO: 9, SEQ . ID NO: 11, SEQ ID 

io NO -.12, SEQ ID NO: 13, or an amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with ATCC as 
Accession Number 98807; 

b) a fragment of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 

15 NO: 5, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO : 9 , SEQ ID 
NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or an amino acid 
sequence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98807, wherein 
the fragment comprises at least 15 contiguous amino acids 

20 of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:7, 

SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ 
ID NO: 13, or an amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with ATCC as Accession 
Number 98807; and 

25 c) a naturally occurring allelic variant of a 

polypeptide comprising the amino acid sequence of SEQ ID 
NO:2, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:8, 
SEQ ID NO:9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO:13, or 
an amino acid sequence encoded by the cDNA insert of the 

30 plasmid deposited with ATCC as Accession Number 98807, 
wherein the polypeptide is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid sequence of 
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SEQ ID NO:l, SEQ ID NO:3, SEQ ID N0:6, or SEQ ID NO:10 
under stringent conditions; 

comprising culturing the host cell of claim 5 
under conditions in which the nucleic acid molecule is 
5 expressed. 



13. A method for detecting the presence of a 
polypeptide of claim 8 in a sample, comprising: 

a) contacting the sample with a compound which 
selectively binds to a polypeptide of claim 8; and 
io b) determining whether the compound binds. to the 

polypeptide in the sample. 



14. The method of claim 13, wherein the compound 
which binds to the polypeptide is an antibody. 

15. A kit comprising a compound which selectively 
is binds to a polypeptide of claim 8 and instructions for 

use. 



16. A method for detecting the presence of a 
nucleic acid molecule of claim 1 in a sample, comprising 
the steps of : 

20 a) contacting the sample with a nucleic acid 

probe or primer which selectively hybridizes to the 
nucleic acid molecule; and 

b) determining whether the nucleic acid probe or 
primer binds to a nucleic acid molecule in the sample. 

25 17. The method of claim 16, wherein the sample 

comprises mRNA molecules and is contacted with a nucleic 
acid probe . 
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18. A kit comprising a compound which selectively 
hybridizes to a nucleic acid molecule of claim 1 and 
instructions for use. 

19. A method for identifying a compound which 
5 binds to a polypeptide of claim 8 comprising the steps 

of: 

a) contacting a polypeptide, or a cell 
expressing a polypeptide of claim 8 with a test compound; 
and 

io b) determining whether the polypeptide^ binds_ to 

the test compound. 

20. The method of claim 19, wherein the binding 
of the test compound to the polypeptide is detected by a 
method selected from the group consisting of: 

15 a) detection of binding by direct detecting of 

test compound/polypeptide binding; 

b) detection of binding using a competition 
binding assay; and 

c) detection of binding using an assay for 
20 Tango- 7 7 -mediated signal transductions 

21. A method for modulating the activity of a 
polypeptide of claim 8 comprising contacting a 
polypeptide or a cell expressing a polypeptide of claim 8 
with a compound which binds to the polypeptide in a 

25 sufficient concentration to modulate the activity of the 
polypeptide . 



• 
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22. A method for identifying a compound which 
modulates the activity of a polypeptide of claim 8, 
comprising: 

a) contacting a polypeptide of claim 8 with a 
5 test compound; and 

b) determining the effect of the test compound 
on the activity of the polypeptide to thereby identify a 
compound which modulates the activity of the polypeptide. 
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>Ccncigl 

3AAG7GAAGATATAATG7ATAGTAGTAATATATAATG7TAGGTGAATTAA 

AGGAAATAGAATATATTGGGGAGTAAT7ATGGGTG7AAAGAAATATAGTA 

GGGAAG7A777AGA777GAGAAAAAAAAAAAGGAA777AG7G7AGG7GAA 

JIAA7AAAAGNANAAGG77AAAAA77AAAAAAAAA77AAA7A7AAA7AAA7 

AAA7AAAAA7AAAAA7AAAA7AAAAAA777AAAAAA77AAAAAAA7A7AA 

AAAA7AAAGAAA7GGAAG7GGA77C77AGAAAAAAAAGAAAG7AAGG7GA 

TA7GAGGAGA7AGAGAGGA7G7GG7G7GAGA7GA77GG777AA77AGAAA 

A7AGG777TGAA7AGAG7GGGAAAG7AGAG7777GG7AAA7G7GGGGGGA 

AGAGGG7AA7G77G777GAG7GAAAGAAAAAA7GG7A7A77777A7AAAA 

TAA7GAGGAAAG7G7G7GAAAAAAAAA77A77GGGA777GGGAAGG7GA7 

A7A7AAAG77G7GGAAAA777GGGGGG7GGGG777A777AGGA77AAAAA 

G77A777AAAGAA7GAAAA7GAA77777G777G7AA777GGGGA7AAGAA 

A77AA7G777AGAAAGAAAGGGAAAAAA77GAAGAAAAAAA777AGA777 

7GGAAATT7AAAAA7A77G7GGG7G7AAA7AGGAAGGA77777AAAGG7A 

A77G7GGAAGGGA777G7G7GGAAAA7AA7AGGGAGAAAAAA7GGGG 

>Contig2 - 

GCA7C7AAC7GGAGCC7GCA77A77ACAGA777AGCA7CACCAAAG7C7A 

AACAA77AGAC7GAC7AAGGCAGAAC7GCCC77A7GACAGCAGACA7AAG 

AAGGAAAAGGCCAAAACAC7G7G77AAAAA77A7CCAAA7G7GAGGAAAA 

GGCAAAGAGAGTAGGTGTSeCTTTTTAGTGTCTAAGCTGCCTGCCCAAGG 

GGCA7CTGA7GC7C7CAGGCAGGAG7CCACAAA777777777G7AAAAGA 

7CAGA7AG7AAA7C77T7CAGCG7GAAGAGCA7GAGG7C7CTG7CACAAA 

TAC7CAACCACCA77ACAACA7GAAAGCAGCCAACAGACAACACA7GACA 

AA7GAG7G7GGC7G7G77CCAG7AAA7C77GA77ACAAAAACAGGCAAGA 

GGCCAGAGC7GACCCA7GGGCCA7AG777GC7GACCCC77C7G7AAAGGA 

AAG7A77777G777GAC77GC7G777ACCA77GA77GAACACAAGGC7C7 

G7AAAG77AC77G77AAC77GCAGAAGA77GA7GAG7GGCAAG7AA7777 

7ATTCACCAGAA7A7AAAA77A777C7G77CAG7AGAAAAGA7AAACCAA 

CTGTGA7A77A7GG7CC7G 

>Contig3 

GGGG7G7C7G7C7ACCA7G7GC7CGCAG77C7G7AA7AAA7G77C7C7CA 

AGATCC77AAAA7C7C77GGAAA77A7AAAAA7A77GGAAAGAGAAGAAC 

AG77777AAAA7A7A7A7A7A7A7A7A7A77777777GAGA7GGAG7C77 

GC7C7G7CG7CCAGGC7GGAG7GCAGTGGCGCAAAC77GG77CACCACAA 

CC7C7GCC7CCCGGG77CAAGCGA77C77C7GCC7CAGCC7CC7GAG7AG 

C7GGGAC7ACAGGCGCCCGCCACCACGCCCAGC7AA77777G7A77777A 

G7AGAGACGAGG7777AC7A7G77GGC7AGGC7GG7C7CAAAC7CC7GAC 

Z77G7GA7C7GCCCGCC77GGCC7CCCAAAG7GC7GGGA77ACAGG7G7G 

AGCCAC7GCACC7GGCGAGTTTTTTAAATA7A77TT7AAAAACACTTGAA 

7AAGAG7CAG7G7AAAC7AGAAG777AAAAA7GC77CACAGAACACCCAG 

GG777ACA77ACAAGA77C7CACAACAAACC7A77G7AAAGG7GAG7AAG 

GCA7G77A77ACAGAGAAAAG777GGGAGCAAAAC7G7AAAAAA77A7A7 

7777G77G7A7777C7AAGAGAAAGAG7A77G77A7G77C7CC7AACC7C 

7G77GA77AC7AC777AAG7GA777CC7TGAGAGCACA7GA7GA7CC 

>Concig4 

GCCG77CA7AGAAAAC7GAAAGCAA7AAGA7GAC7AGG7AAGCA7GACA7 
77AAAAGG7A77CA7GGGACG7GG77ACAAAACCAAC7CACAAC7AAAAA 
G7C77AGGACC7C7CGC7GAC77AGGAGCC7GA7CCCAAC7C7GAGAA7G 
AC7CAG7G7G77ACCC7G7GGC7AG7G7AGACCAA7GA7CC7G7C7CAGA 
G7CAC7AGCCAACAGCCCA7A7CAAG7AC77GAAAC777GAC7CAGAAAC 
C7CAG7GTCAGAACC777GACC7AGGAACCACC7G7AG7GG77AAC7GCA 
A777GCACCCC77AG77CAGGGC777ACAACACCGGGGGCGGGGAGGGGA 
AAGGCA7ANANC7GA7GACC7AAAGGAAACCCA77GCAGCAACGC7777G 
7G77AAG7G7ACAAA7AAG7G77G7777AGAA7CC7CCAGG7AA7GCC77 
7G77A7T7AA7G7G7C7GAGACAA77C7GCACA77AAAGAA7A7AAAA7A 
77ACC77G7AA77CCAA777GAAA7G7G7AA77GACA77AGAC77C7A77 
TGAA77TGAAA7G7C7AAAACAA7G7GG77AAG777G7AAAAGG7G7G7G 
AA7T77GAG7C7GA777AC7ACA777777777AA7777C777777777GG 
AG7777AGGGA77GC77AGA7GGC7AGAAAGA7777A77CA7CAGA7777 
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7AAGTC7GCCTTGGCAGGCAC7TGCAG * oTTTGAAAGAATCAGATA^ATC 
AAATTTGTAGTTTAAAATATTTAAGGGAACTCAATTAACTATGCTAGAAA 
AGAGAATTAAGTATTTAGGAGGATTTAATATGGTGTGAAAGTTGTGAAAA 
^CAAAATGGAGACACTAATGTTAAGAAAACCCTGATAAATGGAACCAGGG 
AAAGGCATGAAGATAGAGTTCTCACACTTGTATCCCTGATCATGAAAAAG 

ATCTGC 
>Concig5 

GGGTTTTTCCGCGTTTTTACCCGAAATCTTCAAGGGATGGGAAAAAGAAA 
ATTGCTAAAAAATCTCGGTTTTTTGGTTTTAACAGATATTTACACCNTGG 
ATCCCATTTATTATGTTGTCCCCAAGGTTTTCGGTGGGTTCCCAATCAGT 

Vagcccccctccacagtgaaagcactttactttatcaccttcacctaaag 
cataaaatccagctcttgaaagctgctccttgttaactgaatatatccac 
atcccaaaagtaatgatccatgcttcataatctgccacggatggatggat 
ggatggatggatggatggatggatggatgaatggatggattgatttcttg 
gaggatttgttgaatttgggaaattccacgccaggacagctggcccaaac 
tgcccgcgacaatctgctcggtacaaggggagggtcctggagagggtgcg 
gcccgagccccagtttggaaatgccaacttggctctgcagccgggcctta 
gccacttgggtctggcgtccctccattattagcgccatgccggctcgggg 
tgctgccaagtccctgagagcacaagcc 

>Concig6 , . 

CGCGCTCAAGAAAAGCTGAAGTGTGAATGTTCTGTC7ACCTTCACAGTAA 
ATGC7AAGAGAATGACCCAAGAGCAGAGGGTATCACTCTGCTACGGAGGA 
TTGATTG7AACTGGCTCTCCTGCCTTAGCAAGAAATGCCAGAACCATGG7 
CA77CAAG77C77GACCAAAAAC7GCC77CA7GAGAA7CAAC77CCCCAA 
GAAAAAAAAAGCAGAAACAGGCAAAGC77CCAGCA7GG7AGG7AA7AC7G 
ACCC77C77CCC7CCTTCC777GGAGA77CACACAG7AA7AA7GCA7AAA 
GC777GCCAA7GGAC7AAGCAC7GCCCAGGGG77777G7CA7GCCTGGAC 
7GAAA7GC7C77777GCG77A7CA7AGAA7CCCAG7GCAG7C7GAG7AGA 
C7C7AAGCAAAAGGGACA77777CAAAAAGGC777AAA77GC7AGTACAA 
AGAAGGCAACAAAACTTGCG7AAC7GTGGACAGA77AAC7CAC77GG7G7 
77TGGC7C77CAG7777CCC77GGC7GCGAAG7AC7CC7GAAGC7T7C7C 
7GCGGC7C77CC7GCAAGCAGGCAAGCAAAAAAACGAC7GAAC77TA777 

CGAGA7 
>Concig7 

GAAGAGCCGC7AAC77GC7G7AG7GA7AAGGAA7GAAC7AAGGC7AGGGA 

CA7A77AACA7CCGC7GG7GG7GAC7C777AGCC7AGA7C77ACCCCAC7 

CC7GC7CC77CCA7A7GG77CGG7C7CAGGC7CAC7ACCGA7CAATGGCG 

7AC7AAAAGCAC7AAC7A7AGAC7CCAACACG7C7G7CG7G7G777CACG 

ACAAGCCG7GGAG77AA7CCC7C7GACAG7AGC7CAGA7AAGGA7GGGC7 

A7CA7GGGCCCGGAAC7GGGGCA7GACGC7CG7CACCAACGCA7GAGC7C 

CCCAAG7A7GC7A7ACCTG7CCC7A7GAAGGGC77CCAAC7C7A7G7GCA 

G7CCCCA7G7GGAGAG7CAGG7AT7GATTGATCAAGCCAGGGG7GTGGTG 

AA7GGGGAGC77CCTACAGGGG7AA7GATAAT7GAAA7GCACGG7GATGG 

GGA7777CA7ATTGGTC7CC7AAGGAGA7AACAGA77GGA7GCGGGG7CG 

A7A77CCAC7GCCCAGGGTG7GTACCGAGGG7ATC7GCAGG7GGA7CTCC 

7CCCCACG777GA77AA7AC7CC7G7C7TGGGAAGCA7AGACGGGCGGGG 

GAAA7GA7GAAGGG7GACCAC7CCCC 

>Contig8 

GGGAACGCAGTGC7C7G7ACGA7GGCC77GA77GCGAA77CC7GCAGGGG 
GGG 

>Contia9 

GGCAAGAGA777AA7A77CA77CCA7C77CA777GGAAGA7GAAAAA77G 
GGGACCAGAGAGGGGAGGGGAC7GGGCCAAG777TCAAAGAAAAG7CAG7 
AGGAA77G7GAAT7CCTGGGGGCCGGGGCCCA77AG7GC7G77T7GGATC 
AG7AAA7GGAGA7G7GAGTTTCAACAG7AACAGGGACA7777AAAA7TAA 
AA7GA777AACC777AGAAAA7G7CC7AT7TTG7AA7AA7GA7GGA7TCA 
CAGGAAGG7ACAAAGAAA7G7CCAGAGAG77CN7GAGCCCCC7TCAGCCA 
GC77C77CCAA7G77AACA7C77GCA77A77A7AG7ACAACA7CAAAAC7 
GGGAAA7CGA7A77GG7AC7G7CCAGA7AGC77AC7CAGA7777GCCAG7 
7A7AC77CCAC7CA777G7G7G7G7G7G7G7G7G7G7G7G7G7G7G7G7G 
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TGTGTGTAGCTCTATGCrtATTTTATGl w * GTAGCTTCATGTAAGCSC C^sJ 

AATCACAATACTTAACTATGCCCTCATCACAAGACTCTC7CTTGCTATGC 

TTTACAGCTGTATCCTCTTCATCTCCAAACCCTAAGCCCACCTCACCGCC 

TCCACCATCTCTAATCCCTGGCAACCACTATTCTGTGCTCCATCTCTGTA 

ATTAATTGTGTTAATTAATGTTATACAAATGGAATCATGAAG7ATGTGTC 

CTTTGAGATTGGGCTGTTAATTTTTCACTCAGCACAATTTCCGTGAGTCT 

AATCCAACTTGTGTGTAGCAGTAATTCTTTCCTTATTATTGCTGAATAAT 

ATGCCATGGTATGGATGTATCACAGTGTGTCTAATCCTTTGCCCATTGAA 

AGGAATTTGGATAATTTCCAGGTTTTGGCTATTATGAATAAAGTGAACAT 

AAGACATGTGTGTACAAATTTTGGTGTGATCAAAAGTCTCATTTCTCTGG 

GATAAATGCCCGGTAATGAAATGGCTGGGTTGTGTGGG 

>ConciglO 

GCAAGAACACAGGCGCGTATTATAACCTTACTACCAAGACCTGAACCCAT 

ATAAAGGTTTATGCGTAACAATCATCATCCCTGTTCCAGAAGATTACACG 

TACGACCACGCCTGGCTCACCGACTCACGTGGGCCAGTACCAGAAATTCT 

CCCAAACAAACAGTCGTGTCTGAAAACAATCGCGGTGACCXCCACGGTTA 

GAAAAGCC7GTTTTCAAGTCCTGGAATTGCCACATATTAGCTGGGTAACT 

TTGGGCATCACATTTACTCTCTCCGAATTTCAGATTGCAAAAACTCATTG 

GATTGTTTTGTGGATTGAAAGAAATAATGTAAATTTAGGCCGAGTGCTTT 

GACTTACGCCTGTAATCCTATCACTTTGGGAGGCCAAAGCAGGAGGGTCA 

CTTGAGCTCAGGAATTTGAGACCACCTCTGGCAACATAGTGAGATCCTGT 

CTCTACAAAAAATTTTTTTTAAATTATCCAGCATGGTGGTACACGCCTGT 

A77CCCAGC7AC7CAGGAGAC7GAGG7G7GAGGA77GC7AGAACC7GGGA 

GA7CAAG7CAACAG7GAGCCG7GG77G7GCCAC7GCCC7CCAACC7CAG7 

GACAGAGGAAGACCC7G7C7CAAAAAAAAAAAAAAAAG7AG7AAG777AA 

AGAAC77AG7G7AGGCC7GGCA7A7AAA7GA7A77G77GA7G77GA7G77 

AGC77GAAGGCACA777A7AGGAG7AGGGA7777A7AACA77A7GAGCC7 

GAGAGCACA7A7AA7G77CCC 

>Contigll 

GGTC7AACA7GC7CCAAC7GAAGAAACCCCACAC77G7CCGGCAAGGAAA 

CTAC7ACAGA777CC7GACC7AC7G7GCAA77CGGGGCA7GCGACGGGAC 

TG7G777C7GGG7ACGC7G7C7CAGG77CG7C7GGGA7G7AAGAA77CAA 

C77CAG7AG77C7C7CA7AGACGCCGACGAGAGGGGCG7C7C7777C7C7 

GA7GAA7C7GCCAGA7C77CCAC77CA7AGAG7C7AAA7CC7CCGA77CG 

A7C7AC7GGAGACCCCCACG7TACAAAAACG7C7AACG7CGG7GACAGC7 

CCCCACA7AGGGAAAGA7CACC7GAG7C7CAC7ACC7CACA77AG7GC7A 

TC7CCAGCCCCA7GC7A7C7ACGAGA7GG7CACGCGAGG777AAGGGG7C 

7CCGA77CCGG7GG7CCGA77CAGC7AA7CG7GGCCC7ACG7GAACGA7C 

AC7CC7GC7CG7AACA7CGA7ACAGGG7CGCGC7GACAAA7GG7AC7ACG 

7AGG77C7CAGG7CAA7GCCGCG7CACGAA7GAGCC7AAC7ACCCCA7AA 

G7GCACG7AC7G7G77ACC777CC7G77CGGCCAAACC7GC7AC7G7A7G 

C7G7GC77G777 

>Concigl2 

AGGC7CCA7G7GC7C7AGCC7GA77A7C7777CAAG7G7777A777GC7A 
A7C7A7AAGGCCCC777CG7AAAA7G77CAC7CA7777C7AA77AGA7A7 
7777777AA7G77GAG7777GAGAG77C777AGA7A7777AGA7ACAAG7 
CCA77G7CAAA7A7G7GA777ACAAA7A7777C7C7CAA7C7G7AA777A 
G7777CA7CC7C77AACAGGG7C7777GGAGAGCAAA7AA777GA7777C 
A7AAGG77CAAA77A77AA777777C77G7A7AG77CACAC77C7AG7G7 
7AAG7C7AAAAAC7G7GCC77G7CA7AGG7ACCAAAGG7777C7CCAG77 
777777C7AGAAG777AGAG777CA7G7777ACA77GGAG7CCA7GA7CC 
A77G77AA77AA77777G7A7A7AGG7AGA7G777AGG777AGGG77777 
77AAAAAAAAA77ACA7A7G777AA77GC7CCAG77CCC77TCA77GAAA 
AGGG7A7CC77CC7CCA77GAAT7GCC777G7CAGAAA77AA77GGACA7 
A777G7G7GAG7C7AT77C7GGGC7C777A7CA7G77ACT7T7AAAAAA7 
GCA7CAG77CC7CCACCAA7ACC7CA77G7C77GA77A77GCAGTTA7A7 
AG7AAGCC77AGCA77AGGAAAAG7G77777CCTGC777A77C777N7CA 
AAAAA77777GGA7A77C7AGGGCC777ACA7A7AAA7777AAAA7AAC7 
T7G7C7A7G7C7AACCGAAAGCC77A7GAAGA7777GA7AAGAA77GCA7 
TA7GCC7A7ACA77AA777AAAAAGAAC7GA7G7C777A77CAG77GA77 
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CTGCTAATCTATGAACAl^GCATCTCTw . CAAAGCATTTAGTCTftSft * 
AATTTCTG7CATTAATTTTTTAAAATTTTCATCCTAAAGATTCTGTATAT 
GTTTTGTTGAATTTATGCTTAAGCATTTCACTTTCTTGGTAACAATTATA 
AATGATTTTGTGTTTTTTATTCCACTAGTTCATTTTCAGTGTGTAGAAAA 
GCAATGAATTTTTGTGTGTTGATCTTTGTTCCTACATCTTGCAACATTAT 
TGAACTCATTTATTAGTTCTAGGAGGTTTTTTCATTTTTCTTGTAGATAC 
CTTGAGATTTTCTATATAGACAGTCATGTTGTCTGCAAACAGGCACAGTT 
TT^TTTCTTCCTTTTCAATCTATATGCCTTTTTTTTTTTTTTTGCCTTAT 
TGCAGTGGCTAGAACTTCTAGCACTATGTCAAATAGCATTGGTGAAAGCA 
GACA7CC77G77CC77G7C77AGAGGAACA777GG7C7T7AA7C77GGA7 
TGCG 

>Concigl3 

gcgcctccttttctcttccaaaatttctcttgtctagttatttgtccagg 

gaaatttgaaagctcacttactgtgcaagtcagcaggaaacaactgggtc 

tgtgcacagcacctagcaaagttctgctctaggaattacactttggccct 

gaggtagatttctacaagaaccttaccttctaagcagcactggggttcat 

ctttttcccagtcctcagagcccattttcactcctgagttctcccccaca 

aaggacattttcaacgttgagtttattactcaacagaaaatggaatgaag 

tccaagacctaaggagatagaaaggggaccagttatggcatcttctcacc 

ccaggacaccttgctgcatgtctctagtgctgaacagaccactggccttg 

<:tgtgtagtttgaaatgctcgctgcaaccagaaaggcaccaaggggccag 

accatgc7ctcctgtctatcacgccttcaaagcagaatttcccaaacctt 

gagtcacagtgctaacacacggggtgccataacatttttgttgattttgg 

cattttacaaaaataaaataaaaaagttaaaaatgcattgctctattctt 

ggggctggcacactattgcctttggccaaatccggtccctgactgttttt 

ttaaataaagttttattgaaacacaaccatgctcttgtgtacatattgtc 

tcttggctgcttcgaagctacaata 

>Concigl4 

GTGTTCGCTTTTTAACACTTACCTAAAATTACTCTGTAATCCATGGATCC 

TTAATTTATTTAAAAAACTAATGTTAATGAGTAGCTTTATTTTCCTCCCA 

TCTAATTTAAGGCCCACAGAACACCTTCACTTACCTCAATCCTCTCCCAA 

CTTACATGCTTTTAATGTCATATATGTTAATACCGTATACTTTTAAAACT 

TTCTAAAATAGCATTATTTTATAGCATGAGTGTTCATTTACATTTTTGCA 

TATAT7TAGAATTTTCTTTGCTCTTCGTTTCTTCTTCTATTTATGACTCC 

CCTCTGGGATCATTTTCCTTCTACTTGAAGTACATAGTTTAGAACTGCAC 

TATTCAATACAGTAGCCACTAGCCATGTGTAGCTATTGAAG7TTAAACTA 

AGTAAAAT7GAGTAATATTAAAAACTCAGTTCCTTCATCTCACTAGCCAC 

A777CAAG7GC7CAGCAGCCACG7GCGAC7AA7GAC7AC7G7ACA7CAAA 

CA7A7AGAACA777CCA7CA7GGCAAAGAGC7C7A77GA7AG7G77CA7C 

CAGAG777C7G77CCAGGACCAAAC7GAGGG77GGGC7GC7A777C7CA7 

GGCCCAA7AACAAGA7GCAGA7GAGC7GGGGAGGAAGAGAG77777A7T7 

C7G CNAC CA777 AC CGGGAGAAGGCC7GGAAA7CA7CAC CAGGC CAAC7C 

AAAA77A77ACG77TTCCAGAGC77A7A7ACC7TC7AAGC7A7A7G7C7A 

CG7G7AAG7G7GCA7TCACC7GAAGACG77AG7GA77AAC77C7777AA7 

C7G7AAC7AAGG7C7GAG7CCGGAAGA7C7TCCCC7GGAGCC7CAG7AAA 

777AC77AA7C7AAA7GGG7CCAGG7GC7GGGG7AA77ACCC77A7C77G 

7CCCC7GC7AAA7CATGGAGG777GGGGATTCC777AGAGCACCAA7AAA 

C77G777G7GGAGGCC7GGGGG777C77C7GACCCACAA7AAAAC77G77 

7AA7CC7AAA7GGG7CC7G77AAGAA77CC77C777A77TTG7CA7A7T7 

TAAGGCCCAGAAAAGGCC7GGGCAAAAC7C77GA7GGGC7777G77ACA7 

7CCAGCC777G7A7AAGAACAC7GG77777AA7A777AAC77AACCA777 

AG7CAG7AC7GAAACAG77GTTA7AGAGA7C7GCA77AG7GAGACC7GGC 

C7GCCACA777CC777TC7GAAGA7C77A7GG7AG7GA7CACCT77G7GA 

AAGGAAAA7AAA7CTTGGGACC7CAAAA7CAC7AAGCCAAAGAAAAAAG7 

CAAGC7GGGAAGAATC7GACAC77AAA7CCAACAC7GC7AAC7CA77CA7 

C7CAC7CA77CA77CA7777A7777C777777C777C77777777777T7 

777777GAAACGAAG7C77GC7C7G7CACCCAAGC7GGAG7GCAG7GGA7 

C7CAGG7CAC7GCAACC7CCACC7CCCGGG77CAAGCGA77C7CC7ACC7 

CAGAC7CC73AG7AGC7GGAA77ACAGGCACC7GCCACCACGCC7GGC7A 

A77777A7A77777AG7AGAGACGGGG777CACCA7G77CA7CAGGC7GG 
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TCTCGAACTCCTGACCTCGTGATCCGC -CCCCCTCGGCCTTGTTTGC1 i 




CACACAAGAANATTGTACAAAACCCCTAGGA 

GGAAAAACCTATCACCGCCTCCTATGGAACTTAAAACAAAAAGAAAAGTA 
ACAAAGGAAATGAATATTTCATTCTGGAAGAACATTGAAAAAGAACAGGA 
AGAAGAGAAAGCACAACTCGAACTGTCCACTAGAATTGACAACACTCTGA 
* wr^-rna a rr-rriTrf:! a ryirtfiT&RGTG AAAAAAATAAGCTCCTC 



CAGCTTTGGCCCAAAGTCTTATAAT'I "1 ~l TflAALA 1 A i 1 x««n.xrv * * 
ATAGGAGAGATAGCCTTCATCTAAGTAGAAATTTAGCTACTCTTGTAAAT 
ACAGAGTAATAATAATAATGACATGCCCATAAACAGTGTCTTTTGTGTAT 
CTGTGCTTTTATAAGCACTTAGCTAAGATTATCTCACATAATTATCATAA 
''CACTGTTACTATGACCACTTTACAAACAAAACTGAGGCACAAAGAAGTT 
GGAAAACTAATCCAAACAAACTGGCTCCAAAAGGAACTTTGCTTTCTTTG 



GGAAAACTAATCCAAACAAACTGGCTtUAAAA^AAV-i i iuwi 
GGTATCAAGTTCTGAAGAGTACACATTTAACATTGAAACTGAGGTCAGAA 
GGCAAGTTTCTATGTAAAGTTGGAGTATTCTGAATACTCTGGGTAGCTAC 
AAATAGTATTTAAATTTTATCTTGGATTCTGCAGATAAGGATAAAATAGA 
TGGTAGGCAAAGAGTATGATCCTTAGGAGAAATTTTTCCTGAAGGAAAAA 
TATATTAATAAAAAATGATGGAATAAACTTCTAAGATCCTTGCCTAGAGC 
AAAACTCATTCAGTCCTTTGGCTGGTAATGTTGAACATCAACAAAAAAAA 
GGAAAAGTTCAGTTTAAGTCTACTCCAGGCAACATTTTCACAACATCCAG 
^•AAATATTAACTATTTCTCTTTGTGGAATTGAACTAGAGTTCTTTTTCT 
TATCCTCTTTTTTGGTTGTTGTATTATTTAAAAATGAGTACCTTTTTATT 
ATTGAAATCATTTCAAGTAATGCAGATAAATGATCAGCCCTCTCCCTGTA 
CAAACATACATACTTAGGCATCCCAAACTTCTCTCTGGAGGTGACCACCA 
TTGCCAGTCATTCATTCTGTTTTCATGCATGTCCATACAGTATAGGTATG 
TCGAGAAATGAAGTATTATATTTTTGTGAGTTGCAATTCTTTTATTCACA 
TTTTTGTGTACTTTGGTTGTCTTTTCTTGTGTTTTCCTAGTACCAATGTT 
ATGCTGACTTAGGCAGATGAGTTGAGTATTTTCCTTTTTGCCCTATAAAC 
TGAAAATAGTTTGTATGACATGAGAATTATTTTTATTTTTTGAAGGTTTG 
ATAAAAACTTGCCCATAAAAATCGTCTGGACCGGTTTCTTGAGGATGCCT 

GTGTTAGAGCC 

>Contigl6 « 

CGCTTTAACCTGGGCTACCAATGGTTCGTCAAGTTCTAGATTCTCTATTA 

ATACCTTTTTCTTGTGTCTTTCTCTGGTCTGTTTTCAGCCCCGAGTCTCT 

TAGATCTGTCCTCTAATATTCCTATTGACTTTACTTCATTTTCTAAGTCT 

-t-ATCCTTTTGCTTTACTTTCCGAGAGACCTGCTTAACCTTATCTCCCAA 

C^CTTTTATTGAATTTCATTTCTTTTACTATATATTTTTTACTTTGAATA 

CACCTCTCTCTTCCTCACATTTTCCCCCATAGTATTTTGTCTTCAATTGA 

CAGTTCTACTATCTTATTACTCTGGAGATATTAATAATAGTTTTTAAATT 

TTTATTTATTTTTATTTTCAAAACAGTGTCTTACTCTGTCACTCACGCTG 

GAGTGCAGTGGTGTGATCATGGATCACTGCAGCCTTGATCTCTGAGCTCA 

AGCTATCCTCCTGCTTCAGCCTCCCAAGTAGCTGGAACCACAGGCATGTG 

TCACCATACCCAGCTAATTTTTTTGTTTTXGAGGTGGAGTCTCACTCTGT 

AGCCCGGTCTGGAGTGCAGTGGTGCAATCTGGGCTCACAGCAACCTCTGC 

CTCCTGGGTCCTGGTTCAAGCAATTCTCCTGCCTCAGCCTCCTGAGTAGC 

TGGGATTACAGAAACACACTACCATGCCCAGCTAATTTTTGTATTTTTGT 

AGAGACAGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCT 

TGTGATCTGCCCACCTTGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAG 

CCACTGCACCCGGCCACTAATTTTTAAATTGTTAATAAAGACGAGGTCTT 




ACATTTTTTAAAAGTTTTCTTCTCTTTACCAAGTCTTTTTTCCCCTTTCT 
GCTTTTTTGGGTTGTTTTATTTTGATCTCTATCTTGCTAGAAACTTTCTG 
CAGACGTTTAGTAATACTAGATTTTTGAGAGTGGGCAACTGGAAAGCTGA 
^TGGAAACTCTGAATACATGGGTGAGGCTTGTTGGCTGTGAGTGTCATTG 
C7TGATGTCCTGGCAAGGCCAATGGGTTTGGGACCCCTACTATTAGTATA 
GGCCTGATTCCCTGGGAAAGGCTCTTTTGATCTCCTGCCTGGAGGATAAA 
GGCCTGGCTACCAGCCTTCTGTGTGTAATGTGAGGGAGAAGGGCTGGAGT 
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ATTCAACATCATGCTGAA* CCTTTCAA * jATCATCTTGTTTTTAGTAAf U 

TCCTACCTTAACTCTCTGTCTTCTGCTAGTATGGGAAAGATGACCTGAAA 

ATCTAACCATTTATTTTTCCCCCATTAATATCATTTTATGATTATTCAGA 

AGTTAAATAATTGTCATGCTGTCCTCCAAAAAGACTGAATCAACTAGCAA 

CAAATAAGAATTTTCTCACAGCTCTGCCAGCATTTTAAAAGAATAGCTTT 

ATTGAGCCCAGGAGGTCAAGGCTGCAGTGAGCTGTGATTACACCACTCTA 

CCCCAGCCTGGGTGACAGAGCAAAACCCTGTCTCAAAAAAGAAATTTAAG 

GAACAGCTTTATTGTTGTAAAATAGACATACAATAAACAGAGCACATATT 

TAAATTGTGCAACTTATACTTTGATATAACCCTGTGAAAACATCACCACA 

ATCAAGATAGTGAATATATTTATCACCTCCTGATACAGTTTAGCTCTGTG 

TCCCCACCTAAGTCTCATGTTGAATTGTAATCCCCAATGCTGGGGGAGGG 

GCTTTG7GGGAGGTGATTGAATTGTGGGGGTGCACTTCCCCCTTGCTGTT 

CTTGAGATAGTGAATGAGCTCTCATGAGCTCCCCTTCACTCACTCTCTTT 

CCTGCTGCCATGTGAGGATGTGCTTGCCTCTTCTTTGCCCTTCTGCCATG 

ATGTGTTTCCTGAGTCCTCCCTAACCATGCCTCCTGTACAGCTTGCAGAA 

CTGTGAGTCAGTTAAATCTCTTTTCTTCATAAATTACCCAGXCTCAGG7G 

GCTCTTTATAGCAGTGTGAAAAGGAACTAATATACCTCCTAAGTTACCTC 

AAGCTTGTTTTTAATTCCTTCTCCTCCCTTCCTTCATTGCCAAGCAAACA 

ACCACCTGTTTTCTGTCACTATAGATTAG7TTACATTTTGTGGGTTTTTT 

TTTTTTTTGAGACAAGGTCTGACTCTGTTGCACAGGAGCAGAGCAGCGTA 

TC 

>ContigI7 

CGCG77A7AGGAGA7GCGAAC77AAGAAA7GA7GA7AAGGAGAC777A77 

AAA7A7AA7777GAA77A7777GCCA77ACAGAAA77C7AA77A777AAA 

A77C7A77CA7AA77777AA7CAC7G7AC77CCCAAGC77AGC77AGAA7 

CC77C7G7GC7GAGGA77AA7777AA777G7C7777A7AGGCC77A7C7A 

AAA7CCAAGAA7AA77GCCAGAA7CAACCACC77C7AAA7C7G7AAG7AG 

AAAT7AG7C7777TAAAAA7A7GCA77CA7AAG7A7GA77AG7AA7AAAA 

A7AA7AAAGA7G77AGCAACC7AAAGAACA7G7A777GAAAGG7A777C7 

7ACAGA7A7AAAAACAG777GG777AA7AAGAGACAA7CA777777GAAA 

AG7A7GACA77T77TGAAAAG7AG777AG7777A77AACCAAGAAAAGCC 

7CAAG7GAAC777AG7CC7C77GA7AGC7AACA777A77GAA7GC77AC7 

G7GTGCC7GA7AC7777C7GAC77GCA77ACC7CAC7GAG7CC7CACAA7 

C77A7GAGGC7AC7A77AG7AGCCCCAC777ACAGA7GAGCAAAC7AAG7 

CACAGAAAGG77AAA7AGG7CG7A7AGC7A77AAG7GACAAAGC7GAGAG 

CC7G7GA7C77AACCAC777GG7A7GC7GCCA7GAAG77AAA7AGC7CAG 

7AG7CA77AAAAGAGAACA777GCA77GAACC77CCAAGCCAC77AACAA 

G7A7A7GC77CC7AA7CAA777AA777AGC7ACA77AGA7AGAA7GG7AA 

AGGA7CC77AAC77AAAG777AAA7GGAAGAAA77AGCCC7C7GAAAGAG 

GCACAGA77A77CA7C7GCAA7AAAAA7C7CACC777AG777777AAAAC 

A7AG77777A7C7G7G77C7GAAA7G7AAC7AAAACAG7GC77CC7GAAG 

7GAAAAA77C7CAC7GG7GAGAA7777AA7AAG7777AA7GA77CACCAA 

A7CAC77CAG7CATA777CAG7CA7A7GCA7A7GCA7ATA7AGACA7A7A 

AG77777A7C7GTG77C7GAAA7G7AAC7AAAA7AG7GC77CC7GAAG7G 

AAAAA77C7CAC7GG7GAGAA7777AA7AAGT777AA7GA77CACCAAA7 

CAC77CAG7CA7A777CAG7CA7A7GCA7A7GCA7A7G7AGACA7A7ATA 

7G77G7A7G7A7ACA7GACA7CA77AGACAC7GTGAAGGA7AGCAAAATG 

7A7A7AAGGCAAAA777A7GAACAA7GG777AACG777GGGAAGCAC7GG 

G77ACAC7777ACT77A7GCAGA77GAACCAG7A7AG7A7GCAAG7C7TA 

AGGAAAAA7C7AC7GGAAAGGGCCC7CA77CAGAC77CCCAGAGGC77C7 

C7GGAAG77GACAA7AC7GAC77CAG7ACA7CAGC7CG7AAA7GAGGA7G 

A7ACC7ACC77ATC7GC7T7ACACAGTTG7AAAAG7AAAAAG7GAAC7CA 

GGAAGGGAA77ACAGAA777AGGAGAAAC7AAAAGCACGA7G7AAA7AA7 

AG7CA7CA77ACAG77A7ATAA7GC77GACAA777A7A7AACAC7TTCGA 

7ACA7GACAACAATAAC7AACACCCAGACA7G777A7A7ACAT7ACC7CA 

C7CAGAACAACCA7G7GAGGAAG77GGCCA7A7GC777AA7G7CCAAACC 

AGGACAC7777GAGAG7AAAAAGCAG7AC7C777GACCAACAGGCA7AAA 

7CAAAAC7A7C77G7GAAAACCGGGA7A7A7GGCA7CC77CC7AGA7AA7 

AGA7AC7777AC7A77A77AA7777GC7G7GAA7C7AAACC7GC7C7AAA 

AAAG77AA7777AAAAAG7AA7GAAG7AC7GA7ACA7GC7ACAACA7GGG 
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TAAAT CTTGAAAACGTT^ f GCTAAGTG. _ AGAAGC CZAGACAGAAAKGGl * 

ACATATTACATGATTCCATTTATATGACACATCTAAAATAGGCACATCTA 

TAGACATACAGAGACAGAAAGTAGACTAGCGGTTGCCAAGAACTGCAGGG 

AGCAGAAGATGGGGAGTGACTGCCAATANGAAAACGCATTACGT 

>Concigl8 

TGAATCGCAATGATATGTGCCACTTTGCACTCTCTGTGACATATATAATT 
ATTTTTAATGCATTCATTTTTTTCTCAGAGTGCATTCGTTTGAAAACATA 
GACGGGAAATACTGGTAGTCTTCCTTGTCAGTTAGAAACACCCAAACAAT 
GAAAAATGAAAAAGTTGCACAAATAGTCTCTAAAAACAATGAAACTATTG 
CCTGAGGAATTGAAGTTTAAAAAGAAGCACATAAGCAACAACAAGGATAA 
TCCTAGAAAACCAGTTCTGCTGACTGGGTGATTTCACTTCTCTTTGCTTC 
CTCATCTGGATTGGCATATTCCTAATATCCCCTCCAGAACTATTTTCCCT 
GTTTGTACTAAACTGTGTATATCATCTGTGTTTGTACATAGACATTAATC 
TGCACTTGTGATCATGGTTTTAGAAATCATCAAGCCTAGGTCAGCACCTT 
TTAGCTTCCTGAGCAATGTGAAATACAACTTTATGAGGATCATCAAATAC 
GAATTCATCCTGAATGACGCCCTCAATCAAAGTATAATTCG^GCCAATGA 
TCAGTACCTCACGGCTGCTGCATTACATAATCTGGATGAAGCAGGTACAT 
TAAAATGGCACCAGACATTTCTGTCATCCTCCCCTCCTTTCATTTACTTA 
TTTATTTATTTCAATCTTTCTGCTTGCAAAAAACATACCTCTTCAGAGTT 
CTGGGTTGCACAATTCTTCCAGAATAGCTTGAAACACAGCACCCCCATAA 
• — - AAATCCCAAGCCAGGGCAGAAGGTTCAACTAAATCTGGAAGTTCC^CAAG 
AGAGAAGTTTCCTATCTTTGAGAGTAAAGGGTTGTGCACAAAGCTAGCTG 
ATG7ACTACCTCTTTGGTTCTTTCAGACATTCTTACCCTCAATTTTAAAA 
CTGAGGAAACTGTCAGACATATTAAATGATTTACTCAGATTTACCCAGAA 
GCCAATGAAGAACAATCACTCTCCTTTAAAAAGTCTGTTGATCAAACTCA 
CAAGTAACACCAAACCAGGAAGATCTTTATTATCTCTGATAACATATTTG 
TGAGGCAAAACCTCCAATAAGCTACAAATATGGCTTAAAGGATGAAGTTT 
AGTGTCCAAAAACTTTTATCACACACATCCAATTTTCATGGCGGACATGT 
TTTAGTTTCAACAGTATACATATTTTCAAAGGTCCAGAGAGGCAATTTTG 
CAATAAACAAGCAAGACTTTTTCTGATTGGATGCACTTCAGCTAACATGC 
TTTCAACTCTACATTTACAAATTATTTTGTGTTCTATTTTTCTACTTAAT 
ATTATTTCTGCAATTTTCCCAATATTGACATCGTGTATGTATTTGCCATT 
TTTAATATCACTAGACAATTCAATCAGGTTGCTACGTTGGTCCCTTGGGT 
TTACTCTAAATAGCTTGATTGCAAATATCTTTGTATATATTATTGTTTTT 
TCTCCTATCTTGTAATTTCTTTGAGCACATCCCAAAGAGGAATGCCTAGA 
TCAATGGGCACAAATAATTTGACAGCTCTTATTAAACATTATTCTGTAAG 
TAAAAACTGAACTACTTTTCAGTATCACTAGCAACATATGAGTGTATCAG 
CTTCCTAAACCCCTCCATGTTAGGTCATTATGAACTTATGATCTAACAAA 
TTACAGGGTCTTATCCCACTAATGAAATTATAAGAGATTCAACACTTATT 
CAGCCCCGAAGGATTCATTCAACGTAGAAAATTCTAAGAACATTAACCAA 
GTATTTACCTGCCTAGTGAGTGTGGAAGACATTGTGAAGGACACAAAGAT 
GTATAGAATTCCATTCCTGACTTCCAGGTATTTACACCATAGGTGGGGAC 
CTAACTACACACAC^CACACACACACACAC^CACACAC^CACACACAC^C 
CATGCACACACAATCTACATCAACACTTGATTTTATACAAATACAATGAA 
TTTACTTTCTTTTTGGTTCTTCTCTTCACCAGTGAAATTTGACATGGGTG 
CTTATAAGTCATCAAAGGATGATGCTAAAATTACCGTGATTCTAAGAATC 
TCAAAAACTCAATTGTTTGTGACTGCGCAAGAAGAAAACCACCCATGCTG 
CTGAAAGTCAGTTGTCCTTTGTCTCCAACTTTACTTCCTTTACCTCTCAT 
ATGTTTGTGAATAAGCCCAATAAGCAGACNCCTCCTACAAAGTGAACCTG 
GTCTCTTTCCTCCTAACAGGG 
>Concigl9 

GTCTTGTAACACAGGTAAGACGAGTTCAAGTTTTATTTCTTGNTTTTAGA 
ACGGTAGTGAGCGGTTTTCAGCNTGAGACCACACCTAAGGTAAGTAGCTG 
AATTGGGGTTTTGTCTTGGCTAAAGTTTAACAACCAGCTGGTCTTAATTT 
CTCCTTACCATTAGAGCACTCAGTAATCATATAAGTTGTGTGATCATTCA 
TTTTGCTTAACTGTTTGTTTCTGTTTTTATTGCTGTTTCAGTCTTTTTCC 
CATTGGGTTTGACCTACTCTATCTGACTTGATCAAATCCAAAGGAAATTT 
CCAAATTATGGGGAATGAGGCCTCTGAAGTGGCTAAATTCCCACCCTCCC 
ACACACACAAACGTGGTATGGTGGGGGAAAAAACGGCCAGCAAAAGAAAA 
AAAAAAAGGAAAAGATGTTTCATTTTGACCACCAAACGGGCTTTATTTAC 
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ATAACAAGGCCACCTTI ^'GCTAGCCA -rCCATACTGAAAGAGGRSXQL ... 
TGTTGCCCCATGCTGTGGGTTCCATAGCTAACG7TCTGCCTTTTTTCCTA 
CCACGACAGCCTGGGTTTGGTTCCTAAATCAAGCCTTTTCTGGTTTGATA 
C77GG7AA7GC7GAAA7AGCAGCAA777G7CC7AGC7GAAA7A7CG7AA7 
AAGATTTTAAAAGATTTATTTTAAAGGACCTCAATAGTTAAAAGTCAGCT 
TAATTAAAAGCTAACATCCAAGATGTGTGCATGTGTATGTATGCGTCTTT 
GTATTTAAATAGCCCTCATGTTTTTTTTTTCTTTCCTAGGAACTTGCCTT 
TT^TTGAGCAAAAGTTTTTTTCTTCTCTGTTGACTGGATTCTGTTTTCTT 
CATTTACTTCTGCTGTCTCTCCTTTCTCTTGCACCGTCTGCTGCATGAGA 
GCCCTAAAATAGTTTATAATAGCCTGGGGTTCCTTAAAGAAAATGGAGAA 
GG7GCCAGGCTCCCTTTTAGGGAGAAACTTCTATTTTTCCTTATGGAATC 
CCTAGAGTG7AAACAGACAAGTTCATTTCAGCTCTTAAACTGCTTGCGTT 
TGTGTTG7GTTACCTGATTTTTTTGACTATTATATTTTTGACTAGCTATT 
GCAACAGAAGC7AC7C77GGGTT77CAAGGAAGA77GTAG777AGACA7G 
7AGAAA7G7C7T77AAAAAAAAAACAAAC77T77T77AAGTGCAC7G7AA 
AAGCA7CA7A7GG7C7AGCC7CC7AA7AA7777CCC7TT77GGAGACCAG 
GA77CAGGG7GGGC7C7GCCCAGAGC7CAGAGA7CCAG77AAAAGAGAGG 
7AG7C7CGGCCGGGCGTAGAGGCCCAGCC7G7AA7CCCAGCAC7TTGGGA 
GGCCGAGGCGGGCGGA7CACGAGG7CAGGAGA7CGAGACCA7CC7GGCCA 
ACA7GG7GAAACCCCG7CTC7AC7AAAAA7ACAAAAA77AGC7GGGTG7G 
- - G7GGCAGG7GCC7G7AG7CCCAGCCAC7CGGGAGAC7GAGGAAAGAGGAG* 
AA7CG777GAACCCGGGAGGCGGAGC77GCAG7GAGACGAGA7GGCGCCA 
C7GCACTCCAGCC7GGCGACAG7GAGAC7CCG7C7CAAAAAAAAAAAGA7 
AGG7AGAC7CGA7G7TG7CG7ACCCGAGCAAG77AGAGCAACGCCACAC7 
77GAGACGAA777AAGAG7CC777A7CAGCCGGCGACCAAGAGACGGC7A 
ACGC7CGAAA77C7CTCGGCCCC77GGAAGGGGC77GA7777CC777A7G 
C777GG777AGGAAGGGGAGGGGAGC7CAG77GCAACAA7TC7ACAGGAG 
7AAAAACA7GCAAAGAAA7TAAAAAGACAAG7GG77ACAGGGAAACAAAC 
AG77CCAGG7GCAGGGGC7C7AAA7C7A7CA7AAGA7GT7AGG7A7GGGG 
GCTC7GCCGGACACAAAC7CAAGGC777A7GC7G77ATC7C7TGAGCGAA 
A7CC7GGGAAC77CG7ACA77GC77GC77CAG7ACC77A7CAG77AA7CG 
GAC7C777GA7ATG77GGGAG7CAGCG7ACACAAG77AAC7CC77GAGGA 
AGGGGG7GGG7AAGGAG7CC77GA7G7C7GGTAAA7GAAGGAGCGAAA7C 
GAG77CC7C7GGC777C7CAGC7AAGGGAGAGC77A77CA7G7GGAAACA 
AGGC7AAG7GA77AAGGGAGAAAGGGAGAG7C7GAAAACAAGG77AGG7A 
T7ACAA7G7CAA7AAAA77GG7C7CC77A7ACAG7CC7A7GG7AGA777C 
T77CCA7C777AA7C7CCCTC7AGCACCACCAGAC77777C7C7C7G7AC 
C77GAGA7G7AAA7777GC7A7C7GAA7777CG7C7AAGAG77G777CC7 
77AA7A7GCAAA777AGGGT7A777AGC7GACAAC7GCCAAAG7AG7GAA 
ACAAG77A7CAAGAAC77GAACG7C7AAGG7AGGAAAAAAAAAAG7CTTT 
A7GAA7C7A7AAGATG7ACT7CTA77GGCATGCC7AA7ACG7C7ATG7A7 
77ACG7G77G7G7ACACAGT7TTTCAC7ACTGAAAA7A7A7AGAGGAG77 
C7AA77AA77GAC7TAAGACAA7AAAAGCGCTTGAA7CAAA7ACC77A7C 
AGGAAAAAGGAAAAGACAAG7CAAA7GC77G77CAAG7C7A7A7AAC77A 
AG7AAAA7C777AA7AAA7AAGC7AGC777AACA77A777GAAA7G7C77 
AAGAA77GCCAGCAGG7TCTGGGT7ACAGAfiC7AG7GGGGG7GCAG7GGG 
G7GAGGG77GG7GGGG7GGGNGG7NNNACNNNNNCNCCCCCCCCCCCCCC 
CCCCCCCCCCCCC7CCCCCCCCGCCCCGNGCGGGCCGCGCCCCCCCCCGC 
CCCCCCGGCCCGCCCCCCGCGGCCCCCCACCCCCCCCCCCCCCCCCCCGC 
GCCCCGCCCCCCCCCCCGCGCCCCCCACCCCCCCGCCCCCCCGCCCCCCC 
CCCCCCCCCCCACCCCCCACACCCGGCCCACACGCACCCCCCACCCCGAC 
GCCCCCGCCCCCCCCCCCCCGCAGCCGACGCCCCCCCCCCGCCCGCCCCG 
CCCCGCACCCCCGACCCCCCCCGCCGCCCCGCCCCCGCCCCCCCCCCCCG 
GCCCCCCCCCCGCCGGCGCGGCGCCCCACCCCCCCCCCCCAGCCCCGACC 
GCGCGCCCCCCCCACCCCCCCCCCAGCCCCCGCCCCCCGCCCCGACCC 
>Concig20 

GGCAG7ACGC7A7AA77CCC7C77CACC77ACC7CA7C7G77C7C7GA7G 
GA7G7AC777777777TAGT77C7AAA77CCC7777CC777GC7C7GGAG 
A7GGG7GA77GA7G7AG7CTGGG7A777G77CCC7CCAAA7C7CA7G77G 
AAA7G7AA7CCCCAG7G77GGAGG7AGGGCC7GG7GGGAGG7G777GGA7 



FIG. 3 (8 of 52) 






WO 99/06426 



PCT/US98/16102 



catgggggcagatccc - jv7gaa7agc jggtactgtcctctcaatag _ j: 
aa7Gagt7C7CC7gaga7A7gg77G777aaaag7G7G7ggcac7ccccca' 

TTGCTCTCTTGTTACTGCTTTCGACATGTGACATCCCTGCTCCCCTTCGC 

TCTCTGCCATGATTGAAAGTTTCCTAAGGCTTCGCCAAAAGCTGAGCAGA 

TGTGGGTGCCATGCTTGTACAGCCTGCAGAACTGTGAGCCAAAATAAACT 

TCATTTCCATATAAATTACCCAGCCTCAGATATTTCTTTATAGCAACATA 

AGAGTGGCTTAATACAGGCTGGGCATGGTGGCTCACGCCTGTAATCCCAG 

CAC7G7GGGAGGC7GAGGGGGG7GGAACA7GAGG7CAGGAGA77GAGACC 

ACCGGCTAACACGGTGAAACTCCATCTCTACTAAAAATACAAAAAATTAG 

TCGGGCGTGGTGGTGGGCGCCTGTAGTCCCAGCTACTCTGGAGGCTGAGG 

CAGGAGAATGGCATGAACCCGGGAAGCGGAGCTTGCAGTGAGCCGAGATT 

GCACCACTGCACTCCAGCCTGGGCGACAAGAGTGAAACTCCATTTAAAAA 

GAAAAAACAAAATTTCAAACAGAACAAAATGAAAAAAATACCAAGTGAAA 

GGCCCCTATAAAAACCCCTCTGGGGCCCATCCTCCCACCCCCTCAAGTGA 

AACCACATTTAACAATTTGGTGCATATCTTTCCAAACCTTTTGTTGTACA 

CATATAAAAAACATACATGCTTTGATTTGGCTCAGACTGTACATAGTGTT 

TTCCCTCTTGCATTTTACACTTAATATATCTTTGACATCTTTC7ATGTCA 

GTGCATGTTGGCTCGATGATATTCTATCATTAAATACCCTTCCAAAAATG 

G7AAAA7CA7777AAAAAA7CA77CACACAAG7ACA7A777ACAA77T7A 

AAAGAAAACAGAATCCCAAAACACAACGACAAACCTCTAAAAATAATCTC 

7AXCTTTCCACCAGCATGGAACAGTTCATTCGTTTTTCACATAAAACGAA " 

TTATGTGATTGGAAAGATTAACTCTAATCTACACATTTATATACAGAATG 

T7C7A777G77AAGCC7A7C7GAAAA7AAAAAA77CAGA7GA77AA77CA 

C77ACAC77AGAAA77AAG7CAA7A7AC7A7GAA7ACACA77G7GA7CAG 

77A7AA7A7GA7GC77C77AG7C7AGGG777CAA77AAA7AACAG7AAAA 

AAAATTGGATAAATAAGACAGCTAATAACTGAAAAATCCAGAAATTCAAA 

GATTATATTGCCAACTAAAACACTGCCATTTACATTTTTT7TTCCTACTT 

GGTAGCAAATGCTAATGGAATTCAATCCTGATTACTTAAAGTCAGTTCAC 

ATCACACATTCAATCAGGATAATACGAACATAATATGCCTACTATAGCGT 

TAGATTAAGACATAAAATTTTTTTGCTTGAAAGTAATGACTGCGTACCAC 

TTGAGACATTTGTCAACCACTTCAGCACATTGTTTACGAGTGACTGGATG 

TCCACAAGGAATAAAAACGACAGCAATATTTCTATCCATACAGATTTTGC 

AAAGCTTCTCCTCTTGCAGGTGTCTTAGCTGCTCTTCAGTACTAATCTCT 

TTCTGCAATGAAGTCTGACTTGATTCGTCTTGTGTACTGTCTTTCTGAGC 

CTTCACTGGATCTGCAATCAGAACCTCAAGTGATTTACAGTTGCTCCCAG 

ATGTCTGAATTTTTTCC7CCATTATTTTCTTAATGTCTTTGAAACTGAAC 

CCCATTCATATAGCTTCTTGTACCATAGGATTATGGAAGATGGTATCAAT 

TTTTC7AGT7AGTGATGGCGTTTTTTCAGCAG7TCTTACCAGACACTCCT 

CAAG7GAA7GGGA7AAA7GAA7A77G777A7A7A7777CG7G7C77C7G7 

7C7AACAGA7A777ACACCC7GGA7GCCA77AACA7G77G7CCCAAGGG7 

C77NC7GGGC7 

>Concig21 

C7T7C7CCC77777ACCCCCA7777CG7AGGGA777GG77AAAACCCA7G 
7AAAAAA7CCAAACACCGGCGGGGAACGGGGG77CAAGC7CG7A7CCCCA 
CCAC777GGGAACCCAAGG7GGCAGGA77G7CGGAAGCCAGGCA777GAG 
CCCACCC77GGGAAAAAAAAGAGAACCCCCA77T77777GAACAAAAACC 
CCAACCC7CCCAGGAAAGAAA7AAG7A7GGC7GGG77GAAG7CACCAAAG 
A7GGCCGAC7GGC7GG7CAAG7AAC777ACC7GA7GG77CG7AGAA7A77 
7ACC77CACCCAGG7GGGAGAA77GC77GAGCCAACCC7CAG7G7GGA77 
CAGGAAC77GA777AA77GG7A7CG7GA77G7GGA77AGA77C7CAGGGA 
7GCA77CAC7AAG7AAAAG7GA7AA7AGC7AC7777AAG7AAAA7AA7GA 
A7GAA7CAAACAC7C7AAA7CCA7GG7GC7A7GC7AAGC7C777C7G7A7 
777A7C7CA777GA7A77ACAAA7A777GA7G7G77AA7AG7AA7GAC7A 
7C7CCA77777ACAAG7AAGGAAAC7GACA77GAGAGA77AAAAGAC7AG 
CACAAA7CACAAAG7AAA7GAGA777GAA7CCGG7C77GA77CCAAAC7C 
7ACAG7A77C7AAA77CAAGGAGAC7AAA77A7AAGA7GGAGAGCCAA77 
T7AC777A7AACAGGG77AGAA7GGCAGAAGAGACC7GACA77CACACC7 
C7AGCCAG7GCA7CA7C77CC7G7AGGCAAA7A7GCAGGAAA7C7A7AA7 
AAGAACG7CC777GG7GAAGGCCAGG7GCAGGGGC77ACAC77G7AA77C 
CAGCAC777GGGAGG7CAAGG7GGGAGGG7CGC77GA7GACAGGAG777G 
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AGAACAGC C7 GGGCAACA'f AG7GAGAC ^ s_ TGTCTCTACAAACAi^A^dA>\ 

ACACAAAACAACTTCAAGAAAACTCCTTTGGTATGGATCAGAACAAGATG 

AATTATCTATCTGATCCAAATGCTTAATGACATTAAGCCACAGTCCACTC 

ACTGCCACAATAGAGATATACCTGCCAATGCCACTCAGGTAATCCCATCA 

AAAGTGGTAATGAGG7CTGCAGCATGACTTGTTCTTAGTGATCCCAGCCT 

GAGACCTTGAGATTGCAGCATTTTATTCTACATATGCACAAAACATCTGT 

TG&AAAA7C77C7AAA77GA7GCAA7ACA77CG7A7CAAGAA7ACC7G7C 

TGTAATCTCCATAAACCCTCTCC7TTCTGTTTTAAAAAATAGTAACAGCA 

TTTCTCCT7ACATGACAAAGAAATGACTTCACCATCTACGAAATAGTGAA 

TAGGAGCTGTGTGGAAGGAAATTAGCTCTACTTCTTGGTGGAGATGAGAA 

GGGAG7G7TCCTCTGAAAATCAAGGCTCTTGTCATGCTAGGAGCCAAAGT 

CG777777AGAG7G7GGACAG77GAGAAGA7AAGACAGGGACCA7CCAC7 

CA7G77777C77A77CCA7AGGCC7C7C7CAA77GGGCAAAGCAC7CCAG 

ACC7777GGAAGAG7GACACCAAAGGCAAGCACC7GC77GGCAGGCCCC7 

CAGC77C7ACGCAAG7A7AAG7GAG7A7A7AAAA7GGGGG7AC77G7GC7 

GrTGAG7ACC77A777CCAAA7GAGGCC7GCCGG7G7CCC7GTGGC7G7G 

AGAAGGCC7C7AC7GGA7AGG7GGAAG77G7G7G77C7CA7C7777C7AA 

CCC7GGA77GAC77GCCCAAAAGGAAGCCA77A77AACAC7A7AA7AAAA 

CCA7CC77AA7C7GGGAC7C7C77CA7GCAG7GG77C77AACCAG7GA7A 

AACA7GAGAG77AC7777GGAGC77AAAAAAA77AAGA7GC7CAAGG7C7 

ACCCAAAeTGAC7GAA7C7CCAGAGG7GAGGCCCAGi5GA7G7A7AC7777 

GAGC C AG AC C7CAG777AC CC7GCAGAGC7CA7AAGG77GCA7AACAC CC 

777G7CAGCCAC7C7GA7GAAAAGAAAAA77GG7GAGGAA7AAG7777AG 

AGAAGAAGGAGCAAAGG7G77C77GGCCAG7GAGAGCCAA7GACAGGGAA 

A7GCAAACAA7G7A7CCACAAGAAAGG7AAA77ACCC7A7AGAGCA7777 

AGGA7AAA7GAACA7CTCA7GCC7AGGG77GAGAGAGGG7ACAAAAAAAA 

AAAAAAAAAAGACCAC7C7GGA7ACACAACGCGA7AAA7GGAA7AAAGAA 

777777CC77G7AAA77AAAAAAA7CC777G77AC7GAGG7A7AA777AA 

7C7A7777A7G7A7AG77CAA7GAGG7G77A7AGA7AA7AAA777777T7 

G7AAA77A77A7A77G7CA7A7AC7CA7ACA77CA77777AAAAG7CAGA 

AA7G7A7A7AACCA77AAAC77A7AAA7CA77CAG7CA77CAGAGA7A7A 

GA7ACACGAGCA7A7777A7A7CCACCACAA7AA77A77ACCA7C7CAAC 

AAT7CCA7CACCCC7CAAA777CAAGCG7AGGGG77777AAA7G7CAAAG 

GAG7C7AC7CAG7GGGAAGAAAG77AAGGAAAAAACC777GGGGC777GG 

GC7CC77CCCCC7GGGG77AAAAAGGCAGGAAA77GGGC77ACCCCCCC7 

GAAA77GGGAAC7GAAA7777GGGAAG777AAAAAAAAAAAAA 

>Contia22 

7CAAGCAGCC77CC77CC77GGC77CCCAAA77G77GGGA77ACAGGCA7 
GAG7CAGGA77CC7GGC77AG777ACA7777C7AGAG7777G7A7AAA7G 
GAAACA7ACAGAA7G7A7777777GCGGAG7GGGGGAG7G777C7A777C 
777C777CCA7777CCCCCCCCCNCCCCCCCGAGACGGAG7C7CGC7C7G 
7C7G77GCCCAGGC7GGAG7GCAG7GG7GCGA7C7CGGC7CACCGCAAGC 
7CCACC7CCCGGG77CAAGCAA77C7CC7GCC7CAGCC7CC7GAG7AGC7 
GGGA77ACAGGCGCCCGCCACCACACC7GGC7AA7777777TG7A77777 
GG7AGAGACGGGG777CACCA7G77AGCCAGGA7GG7C7CGA7C7CC7GA 
CC7CG7GA7C7GCCCGC7TCGGCC7CCC7AAG7GC7GGGA7TACAGGCG7 
GAGCCACCG7GCCCGGCCCAAG7G777C7A777CT7AACCAGC777CA7G 
CAA7C7777777A7777ACCA7C7C7GTGA7CCCAC7CCCAAAGG7AC7A 
GA7G7CGA77GG7CC77AGGATCAGC7ACCA777GCCCAAC7GC777CCA 
GCC77CCAAAAA7777777C77T7777C77AAAGA7AC7CC7G7G7GAGG 
C7CAGAAC7C77GAA77GC7AC7GCAAA7A7GAAC7CGG7GA7G7GAA7G 
CCAGGGAA77GCC7GA77GA7CAAAGAAA7G7A7CCCC7TC7CCC7CAC7 
C77GC7G7C77C7CA777G7777CCCCA7CC77G7GGA77CG7GAA777A 
AA7A7CCC777AA7G77A7AA7A7777AA7GGCG777GGCGAAAAG7ACA 
GAA77AGG7GCAAGAG7GCA7AGC7G77A77777TTT77GGCC7C7GAGA 
C7G77CA7A7A7GCAAG77A777AACAGAAAG77C7GCAG7GACC7GAGA 
7G7CAGGGGGG7C7GA7AGAG7ACG777GAAGGCAG77AC7GGAAAAAAA 
7AA7GCCA77TC7GG777G7AC77CGG7AAG77CAGA7GACCCAA7A7A7 
7G777ACA7G7GGCA77CAG7AAAAAAG7AGC77CCCC7CCC777C77C7 
7CC7777C7CC777CC7GC77C7A7AAAGCA7C7GC777GGGAAAC77C7 
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7AGGAGGAGAGC77GCGr*JCCCG7GGL .^TGGAGAGGTCTTGGA(£AGA . 1 
AAAAGAGATGCTCCCACTCAATGCAGGATGGTGTGGAGG7AAATGGGGAT 
ACGTCTGGCATCACTCAGGAATGGGCCTTCCTGGCAGGGAAAAAAAGGGA 
GGGGAAAGAGGAAGGGAA77CNNANA7NAA77GC7GAA7ACGGGGA77CC 
ATGGCCTGGATCCAGGAAGAGAACTTTGGGAGGTGTGAACCTGGAAGGCA 
TCANCTGATGAGGAGCAGCCTGAACTCCGGGGAGGACCTGTTTTTGGTGG 
CCCGGAAAAAAATGCCTTCCACACACAGGGAGGCCACCCGGCTGATGGGC 
7GGGGG77GGACGGACAGCCC7AGGACAGGC77GGGAAACCAGGC7CAGG 
TAGGGCCTGCGAGGTTCTCGCTGCGTCTCTTTCCTTCCTGGTCTTAGAAA 
ATAGAATCCAAGGCCTCTTGAGAGTGGAAGGTGGGTTGGGAGGAGGGCAG 
ATGGGGCTTAGGCCCAGGACACCCGTAGAGCTACTGCCCAGCTGTCTCTC 
AGGGACTCTGCTGAGGTCACTCCAAGGATCATTCTTAGCCTTGCTAGACA 
GTACTGACAGAGGGAACCGTAGTATCGCACCCACTTCCTTCTCTTTCAAT 
GAAAG7TTAAAGGTCACCATTTCCTCTGGCAAAGGAAGTTCCACAAATAT 
TCCATTTCCGGTCTTAGAAACAGCAAGGTATCAAGCAATTGCAAACTTCC 
TGTGCTGGGGAATTCCCAAGGAAGTAGGGGCAGAGTTCTGGTGGAGACAA 
AGTGAATTCCGAGTGATTAGTCAGTAGCAGTAGCAGTAGCAGTAGCAGTA 
GCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGCAGCAGAACC 
AGAATTTCCCCGCACGTGTCTCAGGCTCTCATTTGCCAACTCAGTCTCTA 
AGTATTTTTATTGGCAGGAAAAATAAAATAGCTATGAGTGAAATAATTCA 
. TTAGACCTGAGCCTGCATCAATTTTGTGTTTAAAGGCCTGACTCTCTTTA 
CC777CCC7GGGA7GGAAGA7GCAAA7G77CC7GA7C7CAC7G7CAAAAA 
AGAAGAACCAGTGGGTATATTGTATGCTTGAGTTCCAGCCATTAGTCACA 
AGACATAGAGATGAC7GCCATGTGTGTAGACTTTCTATAGACTGTGTGCT 
AAACCCGACCTGCCACTTCCAAGGAGTAGATGAGGAATGTCCATGGTTC7 
GGGGAGCCCTACCCCAATTTGGGGCAGACATTCCAAAGCTCATTTTCTGT 
GGAGGGGG77GA7GG77AAAGGAACGGC7GGGA777AC7C77C777C7AG 
GGCCAAGAAAA7GACA7GC7GCC7CCA7G777AA7CA7CC77CCCCC7G7 
7AA7AAC7A7GGC777AAG7CCCCGG77AGGGCC77CC7CCAAAA77GGG 
GAAAAAAA77CCCC7CCCCCCC7AAAAA7777777777AAAAAAACC777 

777AA777AAA777C7CCCCAAAAA777G77777777777CCGCGAAAAA 
AAGACCCCCCCAAAAAAAAAAAG777777GGCGGAAAAAAAAA7A77777 
777G7G77AAGAAA7GGAGAAGAAGGGGGG7777777777C77C7CCCCC 
CACCCGCCAAAGGAAAGG77G77CACAGA77G7777G7G7C7CCCGCCCA 
7 

> >Concig23 

A7G7GCC7GCGAAA7CA7CC77CCAGAAA7A777GCCCC777C7777G77 
A7AGAG7GGCAC7GCCC7A7A7GG7GACCAC77GCCACA7G7GGC7G77G 
AACAC77GAAA77GGC77G7CAGAA77GCAG7G7AAAG7G7AAAACACA7 
ACCAAA777CAAAGACA7GGCACA7AA7AAAAAA7G7AAAA7A7C7CA77 
AACAA77777A7A77GAC7G7G7AAG7AACA7777GAA7A7A77GGA77A 
AA7ACA7GGA7GA7GCCCCAACACCCACAG7CCC77A7CAAG7C7C7AC7 
7CACA77777G7AC77C7GAC77AGAAA7AGCAC7GGCG7C7AAGAGCC7 
A77AA7G7CG7CAA7AGG7TC77GGGAACCACAA7777AAACAAAA7GAC 
A7A7AAGAAAACGAA7AACA77GAACAAAATGACA77A77CGAGGACC7G 
C7GCA7G77G777CAC77AAAG7CAG7G7CCAAGAAAC7A7CAG7GACA7 
77AG7GAGGAA77GC7G7CC77CC7G777ACAGGAACC7GGGCAAG77AC 
77AA77CC7C7AAGCCCGG777A7A7CCC7GCAAAGAGAGAAGGA7AA7A 
A7CACCAG7AC77AG7GA7G7CG7AAGGAGAAAA7AAAA7AA7AAA7A7G 
AAA7GGC7GACAG7G7CC77G7CACACAGAAGA7G7G7GA7CCACAG7AG 
C7GC7A77G7C7GCC7CAC77CAC7AG7AA7GG7CCAGGGAGGCC777AA 
7G7GCA7GG7GCAG7ACA77CACA7G77GGACA7GGG7GAAGGGAAAGAC 
CAGGC7CA7C7AAACACAA7AGGA7GC77G7GG7G7777GAGGAGGAA7C 
AAGGAC7AG77A7CCACAGC7G7AACA7GCA7GGA7CAAAAGAGA7AAGG 
CACACAAAAGAC777G7CAG7AGCAAAGCA77ACAAAA7GCAGAGACCAG 
C7G7GGG7GG7GG7GAG7CAGACCCAGC77CCC7C7G7GCC7GGC7GAG7 
GG77C7GGGCAAG7CACGCCA7C7G7C77GA7GCCC77CCCCA7C7A7AG 
AGAGGGAGCAAC7GAGGCCCC77CCAA7AC7GAAG7CC777A777C7GC7 
AC777AGAAA7A7CCACA77777GG7AAA77CAAA7GA7CCAA7GA77CC 
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ATTTCCTAATGTTCAAAAu f AGCCCCAu ^CATCTAAATGAATEAi^E^ i 

AATAAAATATTTATTGTGTATGTTTTGATTGCTGAAACTTCTATTTTAGC 

AACACACACACACACACACAGAACCCATAAGCCTTCATCTTTCCTTGGAT 

AAACGAGCCTTCCTGTCTGGCCATTTAAGTCACGATTAAGTAAATGATTT 

CCAACTCGCCTTTTGCAGCAGTTCAGATGGGTCTTTCCTGCGTGGCAGTG 

GCCCTCCTGACTTATGATTTCCTGTGTGTCGGCCTGTTACCACTGCAGCT 

TAACTGAGGAAACAAGAACAAAACAGCCTCTGACCCCAAGAGACTGTTGG 

AG&CAAAGGCTTCAGTCCCAAGAACCTCACACGTGGGGAGCCCGAGAGCC 

CAGCCCTGACCTTTTCTCCAGTAATAACATAAGAAACAACAGGCACTGGC 

CTTATTTTGGATACAAAGAGTGGTGCTTTTCCTTAAATCTTCCTTTAGTC 

AGGGGTACCCCTTCATGGACGCCCCAACATCCATGGTTCCTGCTTGAGTC 

CCTGCTTCCATATTCCTGCACTTCTCACTTGAAATATCCCTGGAGTACGT 

TAAGCAGCCAGGTTTGGAAGTTCTTGCTGTGCAGGCGGGTGTGTGCATGT 

CCTCTCTCTCAACAGGACACAAGCTCCCCAAATCAGACGGTATGCCTCCA 

CGCCCCTTCCCAAGCCTCCCCAGCAGCACCGAGCATGTGAGGGGAGCTGG 

GGCCCAGGCCATGATGGGAAGCACTCTCTGCCTAAAGACTAGGGTGATGC 

GCCCTCAACTGTGGGAATGAGCCCCAGCTCTGGTGTCTGCCTCGGTTTTT 

CCTCCTGGACAATCAACATGAACTCCTCACCCCTCTTATCCACTTTGCAT 

AAACTGAAAATAACAAACCCAGGGTCTTTCTGTCACAGGAAAGGGTTTTT 

TTTTAT AAG ATTAAACAGAGATGATTCAACACAC C C AGG AT ATAACACAT 

-GGGCCATGAGTCAAGGCCAGGCATTGCTCTGGTCAGCCTGTTGTTtGGGC 

CCCCTTGGCAGGGCTCTCCCCTGAATCTTCCCCCTCTTGACTCCCCATCA 

CCACAGCACG7CCAGCTTTGGGTACAAGGCCAGTAAATGGGGAAGGGGGT 

CAGATGACATAAAGAGCCCTTTCCTGTCCCATTGAAATATATTTGGATAA 

CAGATGGCATTTCCCCCTGTGTCTTGCCCAGGGCCCAGAGCCTCCACTTG 

CTAGAGGCAGACAGAGGATGGAGAGCCCCTTCATTAGTGGGAGGACATCA 

CAGGTGGGCAAGAAACCACAAGCTTGCACTGAGGCCCAGCCTTGAAATAG 

CAGCACCTGCCGGCACCTGTGGTCTGGGGACAGGGTCACAGGATGGAGGG 

GCCTCCTAAGCCTTTTATCTCTATGTACTAAGTACAACCCATTTTCCCAC 

CTCACAGAGCCAGATCAGCCTCTGTGAGGTCCTGGTGGCAAAAGGATAAT 

TGCCTGCCCGCCTGCCCGCGGTGGGGTGCTTGTGCTTGCATTCCTGGGAA 

GGTTGTTGGGTTACTCTGCAATAGGTCTCTCTGACCAGCTCACCCTCCTA 

CTGCAAACCTCAAACCAACTTCAAAGAAGATCCAGCACC 

>Contig24 

CGCGTAGTCTAAAGACTGAGTCTGAAGCTGTCCCTTCCTGCTATGGACTT 

CAGATTTTAGCCCACTTGAATTGCTCCATATCCTCCAAGCCATGGCCATC 

CCTTGACTCTCTGGGCTCCCAAGCACTTGCTGCCTTCATCACACAGTTTG 

AGTTAAGGCAGAAAGACTGGTTTCCATGTACACTTTGTGGAAGCTT7CTC 

ATTTCTTTATATAATCTCTGTCCTTTGTCTACTGCTTTAAAATCTAGAAA 

TTGTTTACAAACACAAAGGTGATCCTTTAAAAGCTCAAAGCTGATTGTGT 

CACCAATATATACCACTCTTAATGGCTTCCCATTAAACTTTGAGTAAAGA 

CTTTATGGAGCCTACATAAGGCCATGACTACCTGGCTCTTATTTTCCTCC 

TCATCCTCATCTCACCAACTCACTCTCCACTCCTATACCCCTCACTCCTT 

CCCCCTCCTCTCTCTGAGCTCCAGACTCCCAATTACCTACTTCCACCCTT 

TTTGACCCCCAGGGACTTATCTCAGCCTGGAATTTTCCCTCTTTGCTCTC 

CACTGAACTGTCCACTCCCAGTCTAAGACATGTGCTTATGTCACACGCCC 

TTACCGTGCTTATCTCAGTTTGTAATTATCTACTCATTTAGAAAAGTGTT 

GATGAAGGTCTTCACTGTCAGCTTTCAGGATAGCAGGAATCATAGCTGAT 

TTTACTTACTTAACGGGGTTTCATTCTTTGTAACTTTTTTTTTTTTTGAG 

ATGGAGACTCACTCTTGCCCAGGCTGGAGTGCAATGGCATGATCTCGGCT 

CACTGCAACCTCCACCTCCTGGGTTCAAGTGATTCTCCTGCTTCAGCCTC 

CCGAGTAGCTGGGATTACAGATGCCTGTCACCACGCCCAGCTAATTTTTT 

GTATTTTTTGTAAAGACGGGGTTTCATCATGTTGGCCAGGCTGGTCTCGA 

TCTCCTGACCTCAGGCGATCCACCCACCTCAGCCTCCCAAAGTGCTGTGA 

TTACAGGCATGAGCCACGGCACCCAGCCACTCCTTTTTTACTTATGGGTG 

AGAAGCCATTAGAGATCATTTCTTCTTTTCTTTCTCTCTTCACTAAGGCA 

CCAGGGTCACTAAGTAGTAGGATACTTTGAACTAGAACTCAAGAAATTGA 

GTTTTAATTTTACCTCACACTCTCATATGAATTCTCCATGTGACCTCGGG 

CCATACTTCCCCTGTACCCTGTTTCCTCTTTTATAAAAGTAAGAGTTTAA 

ACTAGATGGTCTCCGACATGCATCCTTCTCTAACATATTCTGGAACCTTC 
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AATAAAC7AAGA7AAAC, .tGAATAATTAAAACTTAATTTAAAAGAAGAw 

GGAAAGGAAGCAGTTACATTAAGCAAAAGAGACATCTTCATGGTTGAA^ 

AGTGTATGCCCTGGTGTCTGGATCCCATTTAGGAAACTTGGTAACCT7GC 

AATCTTGGGCAGATTGCTTAATTTCTCTAGACCATGACTTCCTCTTCTGT 

AAGATGTGATAAGAACATCTACCTCACAGGTTTCATGAGAGGATTAAATG 

AGATAATGTATTATAATCCCTTGAACATGGTAGGCTGTTATGTTAAGTCC 

TTTCCTCCT7CTCTGTAGCTATCATGGAATTTAAAAACACATTATAACTA 

GAGCATGAGTTGCGACTAAAGGCTCAATTGTCTCTGCATGTGTTGGCTCA 

TGCATGCTTTATTCCTCTGAAGAGCTTTTATACCAAGTGAAAGGAAATAA 

TTGCATTTCCCTGAAAATTCACAGGAAAAAGTTATGTTTTTCTCTTCATT 

CAAG7GATTCTGTTAGACCCAACCACATGCAACAATTTTAAAGTTGCTTC 

CAAATATATTTACAAATATTTCCTGTCTTCAAGGAACAATGGCAAGACCA 

TGACTCAGGTTCACATCCGGATTCCACCACTAACCATGTACCCAATTACT 

TCAGTCACCTTCATTCAGGTCTTACATATCACAGAATAAAATCAGATTTC 

ATCAGAGGAGGTGAAGACAGGGAGAGGAGATATTTCAATCCCTTCTCCGC 

AACCCCCGTTTTTTTTTTTTTTTTAACAAGGATCCTAGAGTTACTGAATG 

ATAGCACGTTTGAGGGGGAAAGACCCTAAGGATGATCTTTATAAGCCATC 

ACTTGGTGTTGGTGGTGATAAAAAACTCGAGTATCTTTATGCAGTGGAAA 

GAGAAGATTGGACTCGGAATCAGAAGCTTGAGTTCAAGCACTGGTTTCAT 

CAGTCTTGTGATCTTGGGTTGGTCACTTAACCTCTTCAAGGGTCCTCAGC 

TGTGAAAGAAGATAGTATCAGCTAAT-TGTTGTATGTGCAGTGAGGAGGCA 

GTGAGATAGTGCAGGTAAACTATAAAACAATTGTCACATGAAACGCATCA 

CAGTGATTCTTTGGACCCACAAGCTCCAATCTTATAAAACATATCCAGTC 

ACCCACCAACATAGATCATCTCACCTTGCATATCTGATTTTGTGGATCAT 

GGGGAAAAACTGCTGATTCCTAGCAAAACCCATGGCATAGGATAAGTGCA 

CAATAATTTTTTTTTCCTAAATGATTTAGATGACAGTGACTCATTAAGGG 

TTTCCTGAGGCCTCCTCAGAGTCGAGAGGTGGGTGCCTGAAGCCACCCAA 

AGTCCCTGTCACAGGATGGCTCCCAACGCACACACCACAGGCCTGCCCAG 

TATGTTCCACTATCTACCCAGTAGAGCCCTGCCCAGTACGTTCCACTGTC 

CCTTCCCTAGAAGAGGTGACTGTTGTTCACAGTCCCAGAAAAGCGGGCTC 

CCCAAAACAATGCAAGGACCCACCTCTCTCTGAACCTCACCCACCCTAGT 

TTTCCTTTAAAAATCAATTTACAAGAAGATCATGTGAAGGAAAAGGTTGG 

GTGATATTCTAACCCAAGTTAGCTGTTTCTCAACCAAGTTCTCTTTGAAA 

AATTCAACAACCACCTTTGGGGAATTATTTACAACAGAGGAGTGAGGATG 



GATTACCAAAGAGATGGTATGCATTGCTCCCAGAAGCTAAATATCTTCAG 
GCTTTCAATGGTGGCCTTCACCTGAAAATGTTATCCCTGTTGAAGCTTTC 
AAGCCAG7ATTTTCATAAGAACTATATTTTCTTTGGTGAACTGAGGCATT 
ATAATGATGAC7ATACAGGTTCTTGAGTGACTGAAGCCATCATTAGCATT 
37CA77A77777G777AG77GCA7C7CCA7AGCAGC7CACA77CACAA7G 
7GC777GCAA77G77CC77AGCAA7AGCCCTCACAAGA77C7CAGGAGGA 
GAGGG77AA7CCGGA77AACA7T7C7G7GAAGCC7AGCGAGA77AATCGC 

>Contiq25 

AAGAG7777AAAA77AAG7AAGGACGCCGGGAAACAAA7CAA7CCCAGCA 
AACA7777G77GGGA777A7CA77CAAGCAA7777ACAG77A7CCC7G7C 
AAA7ACA7TAAG7G77CAAAA77GGGCA7AGGGGGAACAAAA7AA7AAAC 
CCAGCCAAAACAGAA7AA7CCC7GTTTG7TCAA7G77GGA7AAAAAAGAC 
ATTAC7A77GG7G7AAGGAAA77AGA7ACA7C77CCA7TA777AG7AAAA 
7TACCA7AAC77C7AACTT7G7GGCTTTAGGCAG7CTAGTCCACAGGCAG 
GAAGGAGG777G7777GGCAAA7GAC7G77A7CA7C77C7G777CAAAGC 
7AAACCA7AAAC7AAG77CC7CCCAAAG77AA77CAGCA7A7GCCCAGGA 
A7GAACAAGGACAGCC7GGACG77AGAAGCAAAA7GGAG7CAGG7AGG7C 
AGA7C77C77CAC7G7C7CAG7GA7GGCAG777CA7AACTT7AAA7GA7G 
GC7A7CACAG7777CA7AAA7AATC7AGA7AAACAGT7AAAATAAAATAA 
T7AGG7AAA7G7AG7GCGA7AAA7A77AG7AGACAAAC7CACCA7AA777 
AGAA7C7AAAG77AAA77AAA7AA7AA7A777CATTA777GG7A777TCC 
AAGAAAAACA7A77G7AGGAAACCA77C77777AAAAAAAAAAG7G7CCT 
T77AAAAAGG7GAA7AA77777G7C7AA77CAAAG777A77GAAAAG77A 
7G7A7AAAACAAGG7AAAAGGAACAAGGAAA7AAGGGAAA7G7AAAGAAA 




TG 
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ATTATAGAAATAAAGTGw rATTTTTTGGTAAGAAAGCTTAAAGftGASA*^ 

ATTTTAGGTAAGAAAGAATCTTACCTAAAATTTTGTGCTAGAATAAAGTG 

ACTGGCTAAGAAAGGGATGTTCAAAGCTATTTATGACAAACCCACAGCCA 

ATATCATACTGAATGGGCAAAAGCTGGAAACATTCCCTTTGAGAACTGGC 

ACAAGACAAGGATGTCCTCTCTCACCACTCCTATTCAACATAGTATCGGA 

AGTTCTGGCCAGGGCAATCAAGCAAGAGAAAGAAATAAAGGGTATTCAAA 

TAGGAAGAGAGGAAGTCAAATTTTCTCCGTTTGCAGATGCATGATTGCAT 

ATTTAG AAAAC C C CATCATTT CAGC C C CAAAACT C CTT AAG CTG ATAAGC 

AACTTCAGCAAAGTCTCAGGATACAAAATCAATGTGCAAAAATCACAGGC 

ATTCCTATACACCAATAATAGACTAACAGAGAGCCAAATCATGAGTGAAC 

TCCCATTCACAATTGCTACAAAGAGAATAAAATACCTGGGAATACAACTT 

ACAATGGACATGAAAGACCTTTTCAGGGTGAACTGCAAACCACTGCTCAA 

GGAAATAAGAGAGGAAACAAGCAAATGGAAAAACATTCCATGCTTATGGA 

TAGGAAGAATCAATATCGTGAAAATGGCCATACTGCCCAAGTAATTTATA 

GATTCAATGCTATCCCCATCAAGCTACCATTGACTTTCTTCACAGAATTA 

GAAAAAACTAATAGCCAAGACAATCCTAAGCAAAAAGAACA^AGCTGGAG 

GCATTGTGCTACCTGACTTCAAACTATACTACAAGGCTGCAGTAACCAAA 

ACAGCATGGTACTGGTACCAAAACAGATATATAGACCAAAAGAACAGAAC 

AGAGGCCTCAGATATAACACCACACATCTACAACCATCTGATCTTTGACA 

AACCTAACAAAAATAAGCAATGGGGAAAATAATTCCCTATTTAATAAATG 

ATGTTGGGAAAACTGGTTAGCCATATGCTGAAAACTGAAACTGGACCCCT 

TCCTTACAACTTATACAAAAATCAACTCAAGATGGATTAAAGATTTAAAC 

ATGGCTGGGCATGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCC 

GAGATGGGTGGATCATGAGGTCAGGAGATGGAGACCATCCTGACTAACAC 

AGTGAAACCCTGTCTCTACTAAAAAATACAAAAAATTAGCTGGGCATGGT 

GGTGGGCGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATGG 

TGTGAAACCAGGAGGTGGAGCTTGCAGGGAGTGGAGATCACGCCACTGCA 

CTCCAGCCTGGGCAACAGAGTAAGACTCCATCTCAAAAAAAAAAAAAAAA 

AAAAAAAG AAGGATTTAAACATAAG AC CTAAAAC CATAAAAAC CATAGAA 

GAAAACCTAGGCAATACCATTCAGGACATAGGCATGAGCAAAGACTTCAT 

GATTAGAACACCAAAAGCAATTGCAACAAAAGCCAATTGACAAATGGGAT 

CTAATTAAACTGAAGAGCTTCTGCACAGCAAAAGAAACTATTGTCAGAGT v 

GAACAGGCAACCTACAGAATAGGAGAAAATTTTTTCAATCTATCCATCTG 

ACAAAGGGCTAATATCCAGAATCTACAAGGAATTTAAACAAATTTGCAAG 

aaaaaaaaacccatcaa; ,-JVGTGGGCAAAAGATATGAACAGACACATCTC 

AGAAGAAGACATTTATGT GGCCAACAAACATGAAAAAAAGCTCATCATCA 

CTGGTCATTAGAGAAATGCAAATTGAAACCACAATGAGATACCATCTCAT 

GCCAGTTAGAATGGCGATTATTAAAAAGTCAGGAAACAACAGATGCTGGA 

GAGGATGTGGAGAAATAGGAATGCTTTTACACTGTTGGTGGGAGTGTCAG 

TTAG7TCAACCATTGTGGAAGACAGTGTGGCAATTCCTCAAGGATCTGGA 

ACCAGAAATACCATTTGACCCAGCAATCCCATTACTGGGTATATACCTAA 

AGGATTAGAAATCATTCTATTGTAAAGACACATGCACATGTATGTTTATT 

GCAGCACTATTCACAATAGCAAAGACTTGGGAACAACCCTAATGCCCACC 

AATGATAGACTGTGTAAAAAAATGTGGACGTATACCCCATGGAATACTAT 

GCAGCCATAAAAAAGAATGAGTTCATTCTTTTGCACGGAACTGGATGAAG 

CTGGAAGCCATCATTCTCAGCAAACTAACfiCAGGAACAGAAAACCAAACA 

CTGCATGTTCTCACTCATAAGTGGGAGTTGAACAATGAGAACACATGGAC 

ACAGGGAGGGGAATGTCACACACCAGGGCCTGTCAGGAGGTGGGGGGCAA 

GGGGAGGGATAACATTAGGAAAAATACCTAATATAGATGACGGGTTAATG 

GGTGCAGCAAACCACCATGGCACATGTACACCTACGTAATAAACCTCCAT 

GTTCTTCACATGTATCCCAGAACGTAAAGTAAAATTTAAAAAAGAAAGAA 

AGAAAGAAAAGGATGTTCACGACAAACCAGAAAGTCCAAGCATGTCATGA 

ATAGTCTGTGTAAGTCACAATAAGAGGATTTATTTAAAAAAACTTTTATA 

TGATAAAGTTGTCTATAATTAAAGGGAAATTATAATGGTCTTTCTAGAGA 

TTGGGTTGATGTTAAAAAACTACTTATATATTAAAAAATTGGTTAGAACA 

ATGAAATTTTCTTACGGGGTTGATTCACTCTTAATAAATTATAAGAGACT 



TTAGGTTTTCTCTCCCCTTTAAAAGGGTGGGAAATAGTAATGCCCTCCTT 
CAACTCCCTTCAGCTCATATACGTTTTTTACCCTCAGATTCTGTTTGTTG 
TGTCCTGATGCTAACAATGTTTTCTTAAAGGTCTAAAGGAAATGTTTTCT 
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T C CAACATAAT ATTC TGI ^CATTGCAGAAGGTCTTTTCTTTTGCCTTTTb 1 - 
GTAACTGGCTTAACAGATTTTATG7TTTATTGAAATAAT7TCTATGCCAT 
TATTATTAAGTTTTGGTTTGCTTAGAAAACACTGAGATTAATACAATTTT 
77AAAAATT ATGAT7A7T ACA7 C CATATATCTTTATGTATGTG C77T7AA 
AGTCCTTGTGACATTGAGTTCTAGGGCTTGACTCCTGGGTCTTAAAAGGA 
CAAGTCCTGCTAAATCTTAAATACTGACAGCAATTAAAGGCTCATCTTCA 
GG^CTGGTAGAAAATGCCAATCAAAATAAACTGCATTCTTGAAACACAGA 
GCCAGAAAT7AAAGC7ATTCAAC7CAAGGCCCAGGAACTATAG7GGAAGA 
GGTGGGTGTG7GAGATTGTAAGGGCCAATTTTGAGAGATAAAATAAGTTC 
AATTTCTCTATAAATTAATCATAATCATTGATGTCCAAGCCACACTGATG 
CAAGA7CAGCATATGGGTCCTGTGTCAGATTAACAAGGTTTTCTTGAAGC 
ATTAACCTACTCCTTAATAAAGG7TATAGAGGTTATAAAAGGCTTCTGGA 
AG77A7AGC7A7GG7CAAGA7AAAAAT77CATAGATTGTTAA7ACAATTT 
7GGAAAACAAATTTAA7TGGCT7CTTGCTGT7TTTATTAGGGCTTATTGT 
7TGGAAAA7TAAGTCTCGTC7CTCAAAGAA7GAAGGCT77CACCT7TTTT 
TT777777777TAA7CCTTGAG7TATCACTT7GG7CAAATGAA7GACTTA 
TTT7ACAA7GACCTTTCATCAAG7G7777AAACC77TCAAAT77GACAAA 
CT77CCAAAA7CAAACTACAAA77A7GTC77T77A7GACCTAA7GAA7CC 
TT7AAAA7AC7AGG77CCC7AAAG7CCAAAAAAAAAAAAAAATAACATAA 
7G7GGC77A77TGGTATAAAAA777TACAAGAAACA77G7CAAATATAAA 
- - • ^ ATA77G7G7GGT777G7TTGGGC7G7A777G7A7AAA7A7G7TAT7GG7A 
TG7G77CCAAAATTA7AGGAAAC7CC7ATAA77C7GATA7GAC77GG7GT 
ACA7TA7CAG7AATAA77A7AA77G77A7GG7AAA77A7TG7G7GCCA7G 
GAGG7AACAAA7TTCC7CA7CAAG7G7G7C777GAC7A7GG7TGCCC7AA 
AACT7777GCCATTCACAGACAA77G7CT7GCTTTGGTCC7CT7TAGAAG 
GTGG7777ATAA7CAGC7ATAAAAC7C7AACGGGTGCTCT7GAA7GCAGG 
CT7AAGA7AGC7T7GGAGACTG7GACATCAGAA7AGAGGAAAAAC7TTCA 
GTAT7CA7GGAGTGC7GAAATA77CA7GAA7A7CAAGCAAAACAGGAATT 
AACT7CA7AGA7GGAACTAAAAGAA7GCTGAAG7AATCTTTTTGACTTT7 
TT7CT7AGAATGTTGATCC7TCG7T77GTT777CAGAGTCNAGGAAATT7 
TTC7G77GAGA7A77GACAGC7T7AACAA77AAG7A7ACTCCAGTGAACA 
CAA777GGAGCA 
>Contig26 

A7C7AG7CA77CCCCAGCCTGACCAAT7CAA7GGCCCCCATC7TAG7TAA 
AATTCC7CACCC7GACAAGGCCCCATC7ACGCC7CTGACCTCATGCCC7C 
CAC7C7CAG7C7TGCACTCACCC7GCCACACTCAAGGGCTTCCCCAGG77 
CC77C77AGA77CCACCGA7AGC7CAGGGAC777GCACATGCTACGG7C7 
CTGCC7GGC7CC7CCCCAGATC77C7CATGCC7AGCTGCTTC7CA7CAGC 
ACCCC7CAGAGAC7GTCCCTGCCCCACC7C7CCAGG7TCCATACC7GCCA 
CCC7CCCCCAATCACG7AACAG7TTC77CACAGAGCGAGTTACCA7CCCA 
G7A777CCC7AACT7ATT77T7G7GAC7GG7C7G7TGCCTG7CTCCACCA 
CAAGAACA7AAGCTGCATG7GAACAGGAGCC7TG7C7ATCTTG7CACCCC 
AGTGGC7G7GACA7AACCTGATACACA77AGATGCTCAATGATGTT7GAT 
GAA7GAAG7GCTGG7AGTCCAAC7GTG7TTCC7TGTCTGTGTAAGTATG7 
CTG77G7GG7T7CC7AAGAACC7ACAGCTCTCCCACTG7GACTCCTGT7C 
7A7GG7CC7GATTTGCTGGAC7AGAATCCTAACC7ACATGCTTACTCT7A 
G7GTCC7CCCCCAGAGGCTGAATCCCAGTCCCTAAACC7CCACCAAA7GG 
C7AAGACC7AGC7TCCAACCAGACAGGCC7ACGCTGAGACCTCAGCACCG 
CCC77C7GCGG7C7CATCCTTAACGCATCC77CAGGGCCCAGCTTAAA7G 
7CTC77C7CCAAGGAAGGC7A7CCTC7T7CTGCCCCTCAG7GCTCTCCAT 
GCC7CC7C7A7GCCTCCA7GCC7GC77TCAACCC7GCAGAAGTGGAGAAA 
TTGC7AA7C7GC7G7GT7GACAC7G7GC7GGGG7GCCT7GGGCCAGGGAG 
CAGGC7GG7GG7G7GC7GA7AGCCCG7GGC7GTGCCCAGGTCCATGCTCA 
CT7CC7GAGCCCCAGTGGAGTAGGC7CCC7TTCCC77A7TGCAGCACTCA 
GAGGAAGGACG7GCT7C77AGGACAGATC7GGCCAACC7C7CCC7CGTGA 
GAGAAGGCCCAGCCATCCTCT7GCCC7C777C777C7CCTGCCCCCGAG7 
AATAAAGG7GCC7GG7CAGAGCCTTCTAGAAGGAGACCCAAACA7CCACC 
ACACAT7CCCAG7TCCAACCGTCA7CCACA7GGC7GGC7GTGCAGGTAAA 
CGCAGAG7C7G7T7CACACACCCAACCATC7AG7AT7GGA7GGGAGGACA 
G7AGCG7GACAC7C77CTCCAGCC77GAGCCC7ACTG7GGGCCCCACCCA 
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A.CCCAGATACCAGAGGrtJCCCTGTACTGGGATGC7ATTGGATGG^G'iw€- 

AG7CA7G7ACAAAG77AGCCCTTTG77A7A7AGAG77AGC7ACG7ACA7C 

TTCCTCTGTAGGGAACCCAAGAGGGGAGAAGAGATATGTAGTAGGATTTA 

ACCTGCAAATCCTCTGCTGAGCACCCTGCACTACATACAGTGGGTAGCAT 

GTGG7AGGTGCTCAATAACTATTGACCGATAGATTGAATACAGGTAGGAT 

GGTGACACAATCTAAGATCCCAGGGGTGGGGAGACCACACGCTTGGTTAG 

GGAGACCCAAAGTGGACCGTGTGGCCAGAAGAGTCCCGCACTGCACTCTA 

GTGACAG7GCAGAAAGTCACTGTGGGAAATCTAGAAGTTTCTACAGGTTG 

CTATTTCATCATAGCACTGTGCAGGCCAACCCTTCCTGCTCCACTGGCTG 

TTGGGAAAAGCTTTCTCTTTTCTTCCTAGCCAGGGAGCTCTCAAAGTGTT 

CCAC7C7C7CACCTCCACCCAGGCG7CCAGG7G7GGAGGACAC77GCCGG 

CTGCTTGTCTGCTGACTCATCCCTTGGTTTCACTTGGAAAACCTACCACC 

AGCTGGCCTCTTTCCAAGCATCAGCCTCCTCATTTTCTTAATCCCTTAGG 

TGTGATCTCACCTCCACACAGTAGATTGCCTCAAGGCCCAATTCCAATAT 

GAATAAAAATGA TTAT TTTGTCATCTTCCAATCTTCCTTTTAAAATATTA 

TTTTAT AATT C C CTTTAGGAGGATCAC CTAAGTGAAGACT ATTTTTAC CT 

AAGAAATGTTAAAATGTAAAGACATGGTTGTAATCTGGGGATTCCTGTTA 

AAATGGCTAGCAGACAGAAGTCAGACGACAGGCTAGAAATGTGTGAAGAG 

TGGTTGCCTTTGAAAGGCGGAGTTGGTAATGATTTTCTTCCATTTTTCCA 

TGCTTTCCAATTCTCTACAAAGGCCTTAATATTACTTCGATAACCAGGAC 

CTCTGATAACCTGCCCCCACCGAGTAAAGACTTAGCfTGGGAAAGTCAGCT 

TCATGTGAGGTAAAAGGAACCAGGTAATACACAATTCCCACTGCCAACTG 

TCGGGTGTGCAGGCCTGAGCTTCCTGCATGTGGGAGGAAAGAGAAAGAAG 

AGAGAAACTCCAAGATCCAAGAGATCCAGCAAGAAGGCTGGAGTCTGAGG 

ACGCAGAAAGCTGAATGGCACAGTTACCACTATTGTGCTGAGGTTCTGTG 

GCCTCTGGGTCTCTTGACAACTGGGCAAAGACCCACAGAAAACTATCTCT 

AG AC C CTACCTGTGGGAGGGGAAAGTGCTTAAGATCATTTACAGGACAG C 

CACCTGGACCTCAAATGGCTTACAGTTCCTTCATCCAGAGGGTCTTCATT 

TAGTACATACCAGGTGCTAAGCTGGGTGCTGGAGACATGACGGGGAACCC 

ATTTACCATGGCTTTGTTACTGTGACATTCACATCTAGGGAAAGCCAGCA 

AAGGGGAGGGATCGAGGAGAGCTTGTTAGGCAGAGAAAATACCCAAGGGC 

AAGGGAGAAGCCAGCCTGTTCTGAGCACACACAGTGG7TCCATCTAACTG 

GGCCTCAGTGCCAGGTTGGACTGGAGATGGGGCTGAGGAGCTGTCACAGA 

GCATTCTGGACACAGATGTCACATAGTCCCTTGAGGTTAGGGTCCTTAGG 

CATGGCAGCATTGCTTTGAGTTTTTCCTTTTGTAATGTTGCCATTCATGA 

CAATG7GGAAGATGGGTCCTTGCAGAGAAGGGCAGGGCTGTGAGACCAGT 

TAGGAGACTAAGATGTGAGCCAAGGAAAATGAGGAACACCTGAACACTGG 

GGCAGGTGCAGGGCCCAGAGAGAAGCAGATGGCTTCCTGAGGTTTTAAG7 

AGG7AGAA7CAAGGCAGC7GG7ACAGA7C7777A77ACA7A7AAAC7GGA 

A7AAGC CA7C7G7TCCAAGACAAAAGAG7AGGCGGAAAACAA7ACAAGAC 

AGAAA7GGAA77AGAACAAACC7GGGAGGAA7G7GGAA77AGAG7AGAGA 

G7CCAACAC7GGCTGCAA7CATAAAAATGTAAAACAAACAAAAATT7GC7 

AGG7G7GC77ACTTAGAAA7AA77AGC7GTCA7A77AAG77CAC77GTG7 

7A7GGC77AAA7GTGTCCCCCAAAA7G7GA7G7G77GGAAAC7TGA7CCC 

CAA7GCAACAGAGTTGAGAGATGGGACCTTTAAAAGG7GA77AGG7CATA 

AGGG77C7GCCCTCATAAATGAATTAATACTGT7ATCA7GAGAGTAGAT7 

CC7GA7AAAAGGATGATC7CTGCC7CCTGCCCACAGCCCTCT7GTGCA7G 

C777CC7GCC77TCCACCTTCTGC7A7GGGA7GACACAGCAAGAAGGCCC 

TCACCAAA7GCAGCTCC77GA7C77GGAC777CCAGCC7CCAAAAC7G7A 

AGCCAAACAAA7TTC7GTTTA77A7AAATTACCCAG7C7CAGGTA7TCTG 

T7C7AGAAACACAAAA7GGAC7AAGA7CA77AAA77A7CA777777A7CA 

GAC7G77GA 

>Concig27 

AAAA7A7AACAGAGAG7AAGAGGAAAA77ACC77C777C77T77CC7T7C 
CC7GCC7GACC7TAT7CACC7CCCA7CCCAGAGCA7CCA777A77CCA77 
GA7C777AC7GACATC7A77A7C7GACC7ACACAA7AC7AGACA77AGGA 
CAA7G7GGCC7GCCTCCAAGAAAC7CAAA7AAGCCAAC7GAGATCAGAGA 
GGA77AA7CACCTGCCAA7GGGCACAAAGCAACAAGC7GGGAGCCAAG7C 
CCAAAA7GGGGCCTGC7GCT7CCAG77CCCCTC7C7C7GCA77GA7G7CA 
GCA77A7CC7TCGTCCCAGTCCTG7CTCCACTACCACTTTCCCCCTCAAA 
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CACACACACACACAACAbJCTTAGA 

ACTCAATTTGTAAGTATATAATCCAAGACCTTCTATTCCCAAGTAGAATT 

TATGTGCCTGCCTGTGCTTTTCTACCTGGATCAAGTGATGTCTACAGAGT 

AGGGCAGTAGC7TCATTCATGAACTCATTCAACAAGCATTATTCACTGAG 

AGCCTTGTATTTTTCAGGCATAGTGCCAACAGCAGTGTGGACAGTGGTGC 

ATCAAAGCCTCTAGTCTCATAGAACTTAGTCTTCTGGAGGATATGGAAAA 

CAQACAACCCAAACAACCAACAAAAGAGCAAGATGCTGCAAAAAAAAAAA 

AAA7GAA7AGGG7GC7AAGA7AGAGAAAAG7GGGAGAG7GC7A777AGAC 

AAAG7GG7AAAAACAAAGCCCC77G7GAGA7GAGAGC7GCCGACAGGAGG 

GGGCGGGTCATGGTTGTGGGTTTTTGGGTAGGACATTCAGAGGAGGGGGC 

GGGTCGTGG7TGTGGGTTTTTGGGTAGGACATTCAGAGGAGGGGGCGGGT 

CGTGGTTGTGGGTTTTTGGGTAGGACATTCAGAGGAGGGGGCGGGTCGTG 

GTTGTGGGTTTTTGGGACATTCAAAAGAGTCTGAATGCACCCAGGCCTAC 

AACTTCAAGATGGTAAAGGACAGCTCCAAGGATCAGAAGAAGCATGCTTG 

GAACTGGGGCATTTTGAGAAGGAGGAAAAATATGCAGAGACTAGTGCTTG 

CAGAGCTTGCATGTGGATTTCATTTGAGGTACAATGAAAACGCATTAATG 

GGTTTCACACAGTGCAATGGCCTGACCTCACTTATATTTCCTAAAATAGA 

AAACAGATCAGAAGGAAGGCAATAGAGAAGCAGAAAGTCCAATGAGGAGG 

TTTCACAGCAGTCATGGGGGTGGGGTAAGGAAAAGAAGTGGAAAGAAACA 

GACAGAATTGGGTTATATTTTGGACJATAGAACCAACAGAAGGAAGAGGAG 

AAACAACATTTACTGAGAAGGGAAAAAGTAGGAGAGGAATAGGTTTGGGA 

AATAAATCCTGCTGACATTGGAAACCCCAAGGAAGCCTCAAAAGTATATT 

TACTTGCTTTAGATTTAAAAGAATAGGAAAGAAGCATCTCAACTTGGAAT 

TTGAAATCTATTTTTCCATAAAAGTATTGTTAAATTCTACTCATACTCAC 

AAGAAAAG7ACATTCTAAAGAGTATATTGAAAGAGTTTACTGATATACTT 

AGGAATTTTGTGTGTATGTGTGTGTGTGTATGCGTGTGTGTGTGTTTAAC 

CTTCAAT7GTTGACTTAAATACTGAGATAAATG7CATCTAAATGCTAAAT 

7GA777CCCAAAGG7A7GA777G77CAC77GGAGA7CAAAA7G777AGGG 

GGC77AGAA7CAC7G7AG7GC7CAGA777GA7GCAAAA7G7C77AGGCC7 

ATG77GAAGGCAGGACAGAAACAA7G777CCC7CC7ACC7GCC7GGA7AC 

AG7AAGA7AC7AG7G7CAC7GACAA7C77CA7AAC7AA777AGA7C7C7C 

7CCAA7CAAC7AAGGAAA7CAAC7C77A77AA7AGAC7GGGCCACACA7C 

7AC7AGGCA7G7AA7AAA7GC77GC7GAA7GAACAAA7GAATGAAGAGCC 

7A7AGCA7CA7G77ACAGCCA7AG7CC7AAAG7GC7G7T7C7CA7GAAGG 

CCAAA7GC7AAGGGA77GAGC77CAG7CC77777C7AACA7C77GT7C7C 

7AACAGAA77C7C77C7777C77CA7AGGAGA7GCC7GAGA7ACCCAAAA 

CCA7CACAGG7AG7GAGACCAACC7CC7C77C77C7GGGAAAC7CACGGC 

AC7AAGAAC7A777CACA7CAG77GCCCA7CCAAAC77G777A77GCCAC 

AAAGCAAGAC7AC7GGG7G7GC77GGCAGGGGGGCCACCC7C7A7CAC7G 

AC777CAGA7AC7GGAAAACCAGGCG7AGG7C7GGAG7C7CAC77G7C7C 

AC77G7GCAG7G77GACAG77CA7A7G7ACCA7G7ACA7GAAGAAGC7AA 

A7CC777AC7G77AG7CA777GC7GAGCA7GTAN7GAGCC77G7AA77C7 

AAA7GAA7G777ACAC7C777G7AAGAG7GGAACCAACAC7AACA7A7AA 

TG77G77A777AAAGAACACCC7A7A7777GCA7AG7ACCAA7CA77TTA 

A77A77A77C77CA7AACAA7TT7AGGAGGACCAGAGC7AC7GAC7ATGG 

C7ACCAAAAAGAC7C7ACCCA7A77ACAGA7GGGCAAA77AAGGCA7AAG 

AAAAC7AAGAAA7A7GCACAA7AGCAG77GAAACAAGAAGCCACAGACC7 

AGGA777CA7GA777CA777CAAC7G777GCC77C7AC7777AAG77GC7 

GA7GAAC7C77AA7CAAA7AGCA7AAG777C7GGGACC7CAG7777A7CA 

7777CAAAA7GGAGGGAA7AATACC7AAGCC77CC7GCCGCAACAGTT77 

77A7GC7AA7CAGGGAGG7CATTT7GG7AAAA7AC77C77GAAGCCGAGC 

C7CAAGA7GAAGGCAAAGCACGAAA7G77A777777AA77A77A777A7A 

7A7G7A777A7AAA7A7A777AAGA7AA77A7AA7A7AC7A7A777A7GG 

GAACCCC77CA7CC7C7GAG7G7GACCAGGCA7CC7CCACAA7AGCAGAC 

AG7G7777C7GGGA7AAG7AAG777GA777CA77AA7ACAGGGCA7777G 

G7CCAAG77G7GC77A7CCCA7AGCCAGGAAAC7C7GCA77C7AG7AC77 

GGGAGACC7G7AA7CA7A7AA7AAA7G7ACA77AA77ACC77GAGCCAG7 

AA77GG7CCGA7C777GAC7C7777GCCA77AAAC77ACC7GGGCA77C7 

7G777CA77CAA77CCACC7GCAA7CAAG7CC7ACAAGC7AAAA77AGA7 

GAAC7CAAC777GACAACCA7GAGACCAC7G77A7CAAAAC777C7777C 
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TGGAA7G7AA7CAA7G1 - rC77C7AGG77C7AAAAAT7G7GA7CAGAGLA 

TAATGTTACATTATTATCAACAATAGTGATTGATAGAGTGTTATCAGTCA 

TAACTAAATAAAGCTTGCAACAAAATTCTCTGACACATAGTTATTCATTG 

CCTTAATCATTATTTTACTGCATGGTAATTAGGGACAAATGGTAAATGTT 

TACATAAATAATTGTATTTAGTGTTACTTTATAAAATCAAACCAAGATTT 

TATATTTTTTTCTCCTCTTTGTTAGCTGCCAGTATGCATAAATGGCATTA 

AGAATGATAATATTTCCGGGTTCACTTAAAGCTCACATTACACATACACA 

AAACATGTGTTCCCATCTTTATACAAACTCACACATACAGAGCTACATTA 

AAAACAACTAATAGGCCAGGCACGGTGGCTCAGACCTGTAATCCCAGCAC 

TTTGGGAGGCCAAGGTGGGAAGATCACTTGAGGTCAGGAGTTCAAGACCA 

GCCTAGGCAACATAGTGAGATCTCATCTCTACAAAAAAAAAATGAAAAAT 

TAAAAAATGAGCTGGACATGGTAGTACACACCTGTAGTCCCAGCTACTCG 

GGAGGCTTGAGGTGGGAGGATCACTTGAGCCTGGGAGATGGAGGCTGCAG 

TGAGCCATAATCACACCATTGCACCCCAACCTGGGCAACAGAGTGAGACC 

CAGTCTCAAAAGATAAATTTTTAAAAATGTTAAAAAATATATAAAAGAGA 

ATTTTAAAAGAACAACTAATAGATCAAAGCATGGATGCAAGATATATTTA 

GTTGGAAAATCAAGGTTAAAATCAAGGGATCTTGGAATTAGGTGTGGTAG 

ATTTGGGTAAGGAGTAGTCTAAGATGACCCTGTTTCTTGGTACTGGAGAC 

TGGATGAGTGGCAGCGTCTTAACCATATTTTTGGTAGAAATATGGAGGTC 

TTCTCCATTCCAGGATGAATGATGAGTAAAATTTTAGGCATGTAATTTGA 

GCTACTAGAAGGACACTCAATTGCAGATGfACAATGGGGAGATGATAACC 

TATCTGGAACTCAGAAAAATAACTGTATATAGATATGAAAGACATCAGTA 

GG7ATGTAGTAGATAAAATCCTAAAAGTGATG7CAAAGGGAGAAGAGAAG 

TATATGGTGAACACTGTTGTTTGTCCATGCAATTGCCATCTCTTCTTCTT 

CCTTACTGACAGAACCCTGATTTCACTGAGAAG7CAACATGCCCTTCCCC 

AATTGATGAATCCAATTGGTTGAAGATTATGTTCATTCTATTCTTACATG 

ACTAAGTCACGTTGACTTAATCCTATCAAATGAGATGTCGATCTGGAAAC 

AACTTCTGGAAAAGATTTTCTACCTTGATAAAATAAAGAGCCATATAGAT 

GGTCCTTTATCTTCCTTCTTCCTTGAATGAGATATGTTCTATGAGGAAGT 

GAAGCTTAGAACTGTGGTCAGCAACTTGCAACGACTGGGAAGTCAGAGCC 

ACACAATGAAGAATGCAGAGTGGAAGGAGAAAAAGAGCCAGCATCTCTGA 

CAACATTGTTACACCGAGAACCTACCTCCAGATTTTAAGAAAACAAGAAA 

TGCTACTGTTATTAAGCCATTTCACTGGGTTTGCTATGACTTGCAGTCAA 

ATCTAGCTTAACTGATACAGAGCACCACAGAGAACTGGTCTCTCATTTGT 

CTCATCCTGTTCTTTCTAGCAGCCACGACTTTCCTAGGGTTTCCTTAGCC 

CAAGTCTGGCTAGAGCAAGACTAAGTAAGACTTGATTCCTTAATGTCCT7 

77G7777AAGAAA7A77AAAGAA77A77777A7A77AA7A7A7777AAGA 

AA7AAGGAAA7ACAAAACAC7GAGCAAGCAACACAAA77CAAGAAA7C77 

AAAAAG7A7AA7AGC7GC7CAG7C7C7GA77AACAG7GAAA7A7GGAA7C 

A77G7AGAAA7GGCC77GGAGCG77A77C7CCCAGGCCAGC7A7CC77A7 

GG7C7GCCCCACC7CCC7CA77GCC7AAACAG7AAGAGAGTCCCA7GG7G 

AGAC7CAACAG7C77AGCACAGAAC77G77ACAG7C7A77TC7777C77A 

CAG7CC7A7A7A7CAATTCCAAA7CAA7GAGAG7AAAGCCCAA7CCC7GC 

C777AAACCCAAAGGACAGAAGCCCAAAGCCCAAAGA7A7TCCC7AACC7 

7C7CCCCC7 

>Contig28 

CC7G7CGC7CCC7A7G7T7AAAGC7GGGGA7C7C77777CC7G7G7C7AA 
T7A77TTCC7CA77GGC77GAAAAA7C7GA7AAAACA7777AGGAC7G7G 
7A7AAAA7AGAA77AGCCAAG7GCAA7G7C777A77CAGAAGAAA777CA 
7GGACG77G7GCC7AC7C7C77GGC77CC7GGC77CA7GGC777CCAGA7 
CCCACAG7AAGC7C7GGA7AG7AGAAG77A7AG7AAGACTGAC77C7AAA 
7AAA7GAAG7GAC777AACC77AC7GA7A7GGC77AAAGAAAAGGAG7GG 
CC777AAGA7CCA7GAACTTC7CAAACAAAAG7GA7AACGT7A7C7CCA7 
GCA7A7A7AA7AC7AAA7A7AA7GCAAC7GAGAGAAG7AGGC7G7GG7AA 
GAAAGGAGACCCAAG7GCCA7C7GAAGGCAGCAC77ACCAC7C7GC77CA 
7CCCACCGAGGAAACAAAGCA7GAG7A77GCCAGA7777C77C7G777CA 
AGAAAAGCCAGAAA7CCAGG77777GCG7GAAA7G7CC7GA7777AA7G7 
7GGGAAC7AA777A7A7777GAAA7AACA77G7G7GGGACAAG7GAAC77 
G7A7G7GGAAC7GC777C7CCCAG7GGCGACCAG777GGACCG77GA7AC 
TCAGCAAG77CAGCCAAG7GCGCC77G7CA77G7CAG7CA7CAAGG7GA7 
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GTGTGATTGGTCAAACA^rTAGTTTTGCTCAGCATCTCGTGTGTTTTCAA' 

AGGACCTGAGGGTTCATTTGCCCATGCAGATCTTGTAGTCCTGTTTATTC 

TAT7AATTTATCTTGCAAATCTATAATGTTTTATTTTAAGCAGCGAGAGC 

CGTGGCAGCCTTTGGTCTGGACCCTTTCTAATGATCATTTAGTATCAGGC 

TATGTGGGAGTTGATTGTTTTGCATTGCCTGAAAGCCAACAGTATCACTC 

CTCCTCTAGGTGTGGCAGAGATGTGAGAGAGGGAGACTGACAGTCTGTGG 

Gi^GTGTATGCAGTGTTGGGGGAAGCGAGGCACAGGGGACAATACTGTGGT 

GTATAAAACTAGTCTAAGGTAGCATCAGGAAGTTCATGAAGCCAAAATGA 

TTTTCATAACAGCACAAGACATTATTTGTTTTTGCCTCCCTCTCATTTTT 

7777777777GAGACAGAG7C77GC7C7G7CA7CCA7GC7CG7G7GCAG7 

GGTGCAATCTCGGCTCACTGCAACCTCCACCTCCAGGGTTCAAGCAATTC 

TCATGCCTCAGCCTCCTGAGTAGCTGATTACAGGTCTGCACCACCCCGCC 

GGCTAGTTTTTGTATTTTTAGTAGAGATGGGGTTTTGTAATGTTGGCCAG 

GCTGCCCTGTCATTTTTTTTTTACTAGTGTCCZAGTGGAGTTTTTTAGGGG 

CTACATAACATGATACTGTCATTAATCTAATGGCTAATGAAAGGGATATG 

TATATGTTTTTGTGTTTAAAACAAACTTCTTTGGGGTCCTCAATAATTTT 

TAAGAGTATAAAGGGGTCCTGAGATCAAAGAGTTTGAGTTCTGCTGGACT 

GGGACAGTGGTTGTCAACCCAGATTGTACATTAGGGTCATCTGGGAAGCT 

TTAAAATAGTACTGATGCCCAACCTTACCGCAAACCAATTAAGCCAGAAT 

CTCTGTGGATGAGAAGTCTTCATTGTCATCATCACCATGACCATCATCAT 

TGTCACCG7CACTACACCATTATCATCATCATCATATCATCTTCATTATC 

ATTGTTAGTATCTCCATCACCATCATCAGCATCACCATTATTATCATCAT 

CATCATCCCCACCATCATCCTCATCGGAACTTCACCTGCATGGAGGACAA 

TCCACTATGCATTAGGTGCTATGCTATTTGCTATACTCCTTATTC7CACA 

ACTGCCCAGAGAGGCTGATATTATCTCACTTTATAACAGGAGGAATCTGG 

ATCGGAAAAGTTAAGGTAAGCTAATTCACAGAGCGAGAAGAGATAGAGCC 

AGGATTCGAAACCAGTTCTCTGCTACATCAATGTTCCCAGTCCTTGCACT 

ATTGAGAACCTCTTTAGTTATGCTTTCACCCCTCCAACACCACAGTAAAT 

TTTTTCTTTTTTTAAAAAAATTATACTTTAAGTTATAGGGTATATGTGCA 

TAATGTGCAGGTTTGTTACATATGTATACATGTGCCATGTTGGTGTGCTG 

CACTCATTAACTCGTCATTTACATTAGGTATATCTTCTAATGCTATCCCT 

CCCCGCTCTCCCCACCCCATGACAGGCCCTGGTGTGTGATGTTCCCCACC 

CTGTGTCCAAGTGTTCTCATTGTTCAGTTCCCACCTATGAGTGAGAACAT 

GTGGTG7TTGGTTTTCTGTCCTTGTGATAGTTTGCTCAGAATGATGGTTT 

CCAGC7TCATCCACGTCCCTACAAAGGATATGAACTCATCCTTTTTTATG 

GCTGCATAGTATTCCATGGTGTATGTGTGCCACATTTTCTTAATCCAGTC 

7ATCAT7GCTGGACATTTGGGTTGGTTCCAAGTCTTTGC7ATTGTGAATA 

G7GCCACAG7GAACA77CA7G7GCA7G7G7C777A7AGCAGCA7GA777A 

7AA7 C C777GGG7 A7A7AC C CAG7AA7GGGA7GGC7GGG7 CAAA7GG7A7 

77C7AG77C7AGA7CC7TGAGGAA77GCCACAC7G7C7ACCACAA7GG77 

GAA77AG777A7AGCCCCACCAACAG7G7AAAAGCA77CC7A77TC7CCA 

CA7CC7C7CCAGCACC7G77G777CG7GAC77777AG7GA77GCCA77C7 

AAC7GGCACCACAG7AAA77777A7AGA7777A7AAGCAAA77G7A777A 

C7G7GCAAGAA77GG777A777777AAACCA7G7G77GCAAACA7ACAA7 

GG77AA77G7GA7AT77GCTCAG7ACAAGATCATCAGATCAC7ACACAGA 

C77GAGG7AA77CCACC7AAAAGCAAAGAGAAC7GACCCCACA77AAC7G 

AGAAG7C777AC77A77TAT7CCC7A7AAACGAGCCAATATGAAGAGAAG 

GCC77AA7G7GG77AAC7A7G7AA7777777C7GAC77777GAAA7AC7G 

AGAAGAGC7CA7GAC7CTCCCATC7CC7AA7TC7ACC77GG7GGA77T7A 

GAC7GACCACAACTCA7GGG7AAA7GAGGGAAGACGAA7AAGAAACC77G 

C77777777CC7CC77G77777GGC7GGC7GCAG7GGC7CACACC7G7AA 

7C7CA7CAC777GGGAGGCCAAGG7GGGAAGA7CAC77GAGC7CAGGA77 

7CAAAAC7GGCC7GGGCAACA7AG7GAGACCCCA7C7C7AAAAAAAAAAA 

AAAAAAAAAAAAAGGCGACAGGCGG7GCG7GCC7G7AA7CC7ACC7AC7C 

AAGAAGCCGAGG7GGAAAGA7CAC77GAGCA7GGGAGG7CAAAGC7GCAG 

7GAACC77GA77GCACCACT7CA77CCAGCC7GGG7GACAAAGCAGGACG 

CTGCC7CAAGAAAACAAAAACAAAACCTTAATTTTTTGGCTATTCTTTTC 

7GG7AAGAA7GG7ATAGAGA7GGGGA7GAGGA7GGC7A77G7A7GAGAGA 

GCAAACAGGG7CCAAGCAG7GC7C7GGGC7G7C7AAGGACCAG7AG7CAG 

C77AAC77C7CAAA777CCAGGGAAGGAG77CGGAG7GG7AGAA7A7CC7 
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GGGTATGCCCAAAGCATLaCCTTGCAAATAGCCTGTCATGAAT^TT 
TTCA7TTGTTATGACTGGAAACTGGCTTTGTGTATGCCAGAGAATGGGGG 
CAGGAAAGAGAGATTGGTGTCTTGAGC7CTCTGTGCCTCTGGGGCAGTGA 
7GC7777CC7C7CA7G7GGAAGGAGAGCA7GAC7GAAAAGG7GCACAAA7 
AAGGTG7CTGTGAGAGAAATTAACC7TCCAGATACAGAGACACAACCTTC 
CCCAAGAGG7CC7CA77GC7C7GCC777777CC777TT777GC77G77CT 
AQCA77AA7AACAGAAAC7GA77A7GACC7CAAAAGAGAGGAGAAAGCGA 
CTCTCCCCACCCTAGAGC7AG77AACCACCATATC77CCTAGATATCC7T 
GAGAGCAA7G7AACCC 
>Concia29 

G7GAAC7CG77T7ACC7G7G7AGCAGACCAAGCCGCAGACAAAA7CCN7C 

AGACACCAAA77AAAGAAGGAAGGGC777A77GGGCC7GGAGC7GCGGCA 

AGAC7CACG7C7CCAACAACCGAGC7CCCCGAG7G7GCAA77CC7G7CCC 

7777AAGGGC7CACAAC7C7AAGGCGG7CCACA7GAGAGAG7CG7GA7AG 

A77GAGCAAGCAGGGGG7A7G7GAC7GGGGGC7GCA7GCACC7G7AG77A 

GAA7GGAACAGAACATGACAGGGA7C77CACAG7GC7777CXTA7GCAAA 

7AACCGA77AGA7CAGGGG7CGA7C777ACCAGGCCCAGGG7G7G7CACC 

GGGC7G7C7GC77G7GGA777CA777C7GCCT77TAG77A77AC77C7T7 

C777GGAGGCAGAAA77GGGCA7AAGACAA7A7GAGGGG7GG7C7CC7C7 

C77ACC7GCGGGGAG7GAGC7CAAAC7CC77AAAGGAG77ACC7GCC77C 

CA7CA7CAGGGAAGCAGGAAA7C77GCC77CC77G77GGAAGCAAG7AAA 

AC7CAAAACAAACAAAGAAAAAAACAGGGAG77G7ACAGCAAAA7AAAC7 

T77GA7TT7GACCAAA7777GGGAGA7CAGGAA77C7C7GAAGGAGA7GC 

777CAGACC7CAGCAAA77G7CC7G77GG777GAGCCA7AAAG77AGC7C 

A7GC7GG7ACCAAACACCAG7AGGAGA777G7CAAAGG7AAGAGGCA7C7 

CCAC7CAGAA7CCCT7CG7GG77ACCAACA7G7GAACC77GGAAA7C7GA 

GACAGG7C7CAG77AA777AGAAAG777A7777GCCACGG77GAGGACAC 

CCACCCA7GACAGAGCA7CAGGAGG7CC7GACCACA7G7GC7CAGGG7GG 

7C7GAGCACAGC77GGT777ACACA7777AGGGAGACA7GAGACA7CAG7 

GAA7A7A7G7AAGA7G7ACAC7GG77CCC7CCAGAAAGGCAGAACAACTT 

GAAGCAGGGAGGGAGC77CCAGG7CACAGG7AGG7GAGAGACAAACAAIT 

GCA77C77C7GAG7G7C7GA77AGCC777CCAAAGGAGGCAA7CAGA7A7 

GCA777A7CACAG7GAGCAGAGGGG7GAC777GAA7AGAA7GGGAGGCAG 

G777GCCC7AAGCAG77CCCAGC77GAC7777CCC777AGC77AG7GA77 

7GGAGGCCCCAAGA777ATT77CC77C7ACA7CAC7G7GGGCAGC7GAC7 

AGGAAAGC777G7AGGAC7GG7GGGCAG7G7GAGAGCCCAG7GGGGGG7G 

G7GG7CCTG7GCCAA7GG7AGCAACCACC7G7GAGGC7GAG7AAAC7CA7 

T7CCCAACC7CC7C7AGCAGCCCCAG7GGAGA7ACAGAGGAAGCAGAC7A 

GCGA7ACAACCCAGCC7GAAG7777G7C7GG7GAG7G7AA7GGAA7AAAA 

A7GGGAAGGG7GC7GAAGAGACCAGCAAGAAAA7GG77GAAGAGA7GGGG 

CACAGAAA77AAGC7GGA7CAAAAAGGACGGAAAAGCAGAAAGGGCCGA7 

AGAGAGAGGGGA7A7C7A7GGG77CGCGA77C7GAAAAGGACAAA7CAC7 

GG7GC777GAGAAGAGAGAGGG7GAGAAAGCAGGAAGGC7GGAGGCTGTC 

A7CCAAGAGGCGGACA7C7G7GAACA7GA77CCAAGAG7CACCAGACCA7 

GGGGG7GGCCAAAGGGAG7GCC7C77C7CAqC7CC7AC7C77AA77CC77 

G7AC7CAAGA7AA7AAG77CCCAGAAGAGAAG7ACCCA7A777AA77CA7 

C7G7G7C77CC7AGCAG7AC7AAAAA7A77A7A7GAAAGG7A7CAAACC7 

77GAGAA7G7G7GC7GC7AAA77G77AAGGA7GC7GGAAAAC7CAAGACG 

7CCC7GA7CC7GAGCC7GAG7A7GAGCC7G7GG7GAGCCCAA7GCAGG7C 

7CCA77CAGACAAAGGCC7CAGGGAACGGA7GAGACC7AGGGACAGAGA7 

GCA7GC7GGAGCAGCA77CCCCA7CCC7AC7GCAGC7CAGGCCAGC7GAC 

TGC77TA7GAG7AAACG77ACCAGGGAACAC777GCAG7C77AACACACA 

7GCCCACC7G7GACCAC7GA7CCC7G77GGG7GACCAC7GACA7CAGAGA 

77CGA7GGCAGCAA7GAAGACAAGGC7A7CC7CA77AGGAAGGAAAGGAA 

GGAGGAGGGAGGAGGGCAAACGAA7C777CC7GC77G7CAACCACG7CCA 

7C7C7G77AGG7GA777CCCA7G7G7GAC777G777A7C777A7AA7AAC 

TC7GAGAGG7AGG7C77GA7G7CCACA7777GAACA7GAGGACA7CCAGC 

CAGGAAG77GAG77C7GGGGACA7AGC7GAGAGGGCAAAGC7ACA7A7AA 

ACCCC7C7TTG7TTTTTCTGGC7TATCCACTGAGTGCCCCCTGCAATCCA 

CCAGCCCA7TTG7GAAGTGCATACTATAGGTAAGTTGGCACAGGAGGAGT 
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GAGGGCCAGG7A7G77CC7GACCAG7CACAA7CAAAGCAACC7CC7AC7A 

A7CAGGGAGGC7TGG7ACC7GGGGAA7GC7A7G77GAAAGG77C7777C7 

GGGTTTTAAAATGATGGGTCTATTTCCTTATTCTTAAGATTGCTTTTTTT 

C7GGC7AGAAC77AAAAGAAA7777CAG7AAAA777CCC77CCC7GGCAC 

AAAGTGAGCTTGAAATGAATTCCCAGGTGGCCTTGATACTTTAAAATATT 

GCC7CC7A7AAAA7CAACC777AGAAGAAGGAAG7CAAAGAACA7GC7AG 

ATTTCACAAAGGTTAATTCCTTGAAATCCAGTTATCTACAGGACAATGTT 

GTCAAAGAAAAAATTATTTGGCCAGGCACGGCGGCTCATGCCTATAATCC 

CAGCACTTTGGGAGGCTGAGGCAGGTGATCACCTGAGGTCAGGAGTTCGA 

GACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAA 

AAAATTAGCCAGGTGTGGTGGTGGGCACCTGTAATCCCAGCTACACGGGA 

GGC7GAGGCAGGAGAATCGCTTGAACCCGGGAGGAGGAAGTTGCAGTGAG 

CCAAGT7CAAGCCACTGCACCCCAGCCTGGGCAACAGAGCAAGACTTTGT 

CTCCAAAAAAAAAAAAAATTCAATGATATTTTTAAATTCATGGTAAGGAA 

GATTTCATTCAGAACCAGCACAGAAGATATAGGAAACACTGCAATGGGAC 

TTTGCGGTGGGGGAGAGAGATTGAACACAACTACATATACAGCACGGGCA 

AGGACATATTCATAGCCAGGAAGCAGAGCAAAGATCAGTGGATGCGAAAT 

TACTAAGAGGAAACATGAAAAATAAGGGAGCTTCTGCCTAAACCCACCTA 

ACCGGATCCTTGCTGAAGACAGGACAGGGTGATTGGACACCACTTTGGGG 

ATGGTGGAGGATGGGGAATCCAGTGSGATTTCAAGGGTGATGCGATATTG 

AACATACAAAGTTCTTGCTAAAAAAGGATTTTACAAGAAAGTGTACAAAT 

GTGCCTGGGACAAGG7GCAGGAGCCCGACGGAGATGTGG7CCAGCAGAGA 

A7A7G7GCCGAGA7GA7AGG7GAG77C7C7GACGAAGGA7A7A7GCTGA7 

CCAGCCAGGG7GAAA7GC7CAGAGAAAGCACGGAGGGGC7A7G7CCG77G 

CCCCAG7C7CCACGCGG7CAAA7C7GA7CCCG77G7GAG7G7GGCCG777 

G7AGAAAGCAA7CAGGGGGGG7CCC7CCCC 

>Concig30 

AATA7A7A7777T7A7ANNATN7GAGACAGG77C7CAC7AGG77GCCCAG 
GCTGG7C77GAAT7CC7GCCTTCAAG7GAC7C7CCCACC77AGCC7AC7G 
CATAGC7GGGA7TACAGGCACAAACCAC7GCA7GCAGC7AAC777GC77C 
7CA77CCAGCACT77T7A7TCCAC7GA77A7A7G7A7A7G7A7A7C7GCA 
TCA7C7C7C7C7C7C7C7C7C7C7C7C7C7C7C7A7ATA7A7A7A7A7A7 
A7GGAAA7A7C7C7C7C7CTC7C7A7A7A7A7A7A7GGAAA7A7A7A7C7 
CAG7C7C7CC7A7CC7CC7T7AA7CAG7777GC7A7CC7G7CAA77CCCC 
CAACGAG7G7GA7G77G7GAAA7A7A7A7T7G77C77CA7C7CC7G7TTC 
C7GACA7ACAGC7777AAAAACCC77GGAA7C7C7GGAA7AA7AAGAG7G 
TC7777GCA7GC7AA7AGA7GAC7GC7GGC7GGCAGCCCCAA7GCAG7AG 
CT7CA7GA7GGGGTT7GTCACAGGAAAGACCAAGGCAGGATTGGAGACTT 
GAGAC7G77AGCCCCAC7CCCCAACCAC7GGAGGGAG7GGAGGGGC7GAA 
GG77G7G7CAG7CACCAA7GGCCAA7GG77CGG7CAA7CA7G7GTA7G7A 
A7AAAGCCAC7C77AAAAACCCAAAAAGGACAGGGT7TGGAAGGGC7CCC 
AGA7AGC7GGACACA7GAAGGTTCC7GGAGGG7GGTGCCCCAGAGGGGCA 
7GGAAGC7CCACACCCC77C7CACA7GC777GC7C7GCGCA7C7C77CA7 
C7GG7G77CA7CTG7A7CCTTTG7AA7A7C777TAGAATAAACTGGTAAA 
C77AAG7G7777CC7GAGTTC7GTGAGCTGC7C7AGCAAAT7CACGGAAC 
CCGAGGGAAGCAAACCCAGATT7A7AGCCATCAGTCAGAAGCATAGG7GA 
CAACC7ACCAC77G7AAC7GGCACC7GAAG7GGGAGGCAG7C77G7GAGA 
C7GAGCCC7CAACC7G7GGGA7C7AACGC7AAC7CCAGG7AGATAGTG77 
GGAG7GAA77AGGACACCCAAC7GGTG7CGGC7GCTGGAGGACTAG7GG7 
GGGAGAAA7CCCCAAGCA7T7CGG7GAC7AGAGG7CACAGAAGAAC7CAG 
7G77GAGG7G77G7GACAGTA7GG7AGGGAAAAC7GCG7C7GGT77777C 
C7777ACAA7CAG77AAA7AT77AACACAAG7C7AC7G7A7A7TAG7AAA 
AGGG77ACA777777AA7G7C77GACAG77GCAC777GACAAC77CCA7A 
7CAA7CAC7777T77CG7GTCCG777GGAACCAAAA7CAC77GGGA7ACC 
A7GAACCAGGC7GCAGCGTAT7CCCCAGGCC77GAAAGC77GGAGGCCA7 
7TTGCCAGCCN7AATCCC7G7GAA7ACCAGGC7TCG7GGA77TAAAAAA7 
AGAC77GAGGCCAGGCC7GGTGGC7CACACC7G7AAGCCCAGCAC777GG 
GAGGCAGAGGCGGA7AGA7CACAAGG77AGGAG77CGAGACCAGCG7GGC 
CAACA7GG7GAAACCCCG7C7C7AC7AAA7A7ACAAAAAAAAA77AGCCG 
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GGCG7GA7G77ACACG^ JAGTAGTGCCAGATACTCAGGAGGCTTC^GGC^G ' 

GAGAAA7AC77GAACC7GGGAGGCAGAGG77GAAA7GAG7CAAGA7CG7G 

CCACTGCACTCCAGCTTGGGCGACAGAGTGAGACTCAGT7TTCAGGGGAG 

TTAAAACAATACAAAAAAAGAAAAAGACTTGAACAATGAGGCTCCACTGG 

ATGGATTTAGGGGAATTACAGGAAGCAGGACCTGACGGTGCAATGCCACA 

CTCCACCTGTCCAGAATTGGACCTCACCAAGGGAGGTCTGTGGGGACAGG 

GAGAGGCCCTCTGCCTCCACCCCCTCCTCTACTCCCCAAACCCTGAGTCA 

G6CTGAATGTAGTAAACCTGGAACAGAAAAGTTCAGTTTGGCAATAGGTA 

TCTGAAGGACTCCAGGTGCTTCTCCCTTGATTCAAAATTTTACTTATAAA 

AAAAATTATAAGAAAATTCTACTTAAAAGAAATAATCAGGGAGGTACAAC 

AAATTGTACTTTTTTTTTTTTTTTTTTTTTTTTGAAATGGAGTCTCACTG 

T7GCCCATGCTGGAGTACAGTAGTGTGATCTCGGCTCACTGCAACCTCCG 

CCTCCTAGGTTCAAGTGATTTTCCTACTTCAGCCTCCCAAGTAGCTGCGA 

TTACAGGTGTGTGCCACCACACCCGGCTAATTTTTGTATTTTTGGTAGAG 

ACGGGG7TTCACCATGTTAACCAAGATGGTCTCGAACTCCTGACCTCAGG 

7GACCCACCTGCCTCAGACTCCCAAAGTGTTGGGATTACAGGGGTGAGCC 

ACTAAGCCCAGCCATTGTACATATTTTGTGGGTATTTACTAAAACATTAT 

TCAAAATAGTAAAAAAAAATTGAAATAAACTGGGGACTGGTTAAATAATT 

TTGGGTACAACCACATGATGGAATACTATACAGCCATTAAAAATTACATT 

GAGGCCAGG7GTGGTGGCTCATGCTTGTAATC7TAGCACTTTGGGAGGCC 

AAAG7GGGAGGAT-FGC77GGACCCAGGAGC7CAAGAC CAGC77GGGCAA7 

G7GGCAAAACCC7G7C7C7AAAAAAAAAA7ACAAAAAAAA77AAAAAGC7 

GGG7G7GGAGGCACACACC7C7AG7CCCAGC7AC7CAAAGGGC7AAGG7G 

GGAAGA7CAC77GAACCGGGGAGG7CAAGGC7GCAG7GACCCAAAA7CGG 

G7CA77GCAC7CCAGCC7GGGCAACAAAGCAAGACCC7G7C7CAAAAAAA 

AAAAAAA7ACA77GAAGAA7A7C77ACGG7A7GGA7AAA7A77CA7777A 

CAG7GA7AGA7GCAAA7AAAAGCAAA77ACAAAA7A7ACAG777AA77CC 

AAC777GA7AC7ACA7A7G7A7A7A7GAA7ACA7GCA7A7G77A7G7A7G 

7A7A7G7AAA7A7AACAA7A7A7G77C7A7A7ATGGA7A77A7A7AT77A 

CACA7ACATACACACA7A7A7AA7A7C77C7C7AGAGAGCAGAAAGAGAG 

7AGACAGA7AA7GAAGA7AGGA7ACAAC7CCAG7CCAGC7CAACC7AGGG 

GAC77G7777AAAGCC7CAGGAGAGAGAAG77GGGAC7AGAAAGCAAGGC 

AGCTA777G7AAGCA7C777G7G777CA7GC7A77GGGG7GGGAAACAAC 

AGCACAAC7777GAAAGCCCC777C7AC7CACCCCACAAAC7GCAGAGCA 

GC777AGGACCC7CAGAG77CAAGAAGACCA777GCAGAG7AGAAGAAG7 

AAAAACA7G7A7GAAC77GACCC7GAGC7CA7GGAC7G7GCCA7GAGGGA 

AA77CC7AAAACAGCAGGAGAGGCCC7GGAGGAAGGCAGAGGCCC7GCA7 

CAGCAAG7CCAGGCAAAAGCC7GCA77CCA7AGA7GC7CA7C7C7C7GGC 

7GG7GAGG7C7AAAGACG777GG7C7CAA7A77AAG7C7CG7GAGAGAGG 

7CACAAACCCAG7CCCT7GGCCACAAAAGGAAA7AAA77C7GGC77GAGA 

CATTAGGGAGGAACAGGGCAAGGGGAGG7TCAAGAAAGTTTTAATGGATG 

AGA7GA7A7T7AAGCAAGGCCC7GGAAAA7GAGAAT77CAACCAA7AGCC 

A7A7GG7AGGTCAGAAAGCAAAGATAAGGAGGGGGCAAG7GCAAGGGGCA 

ACA7CAGA7A7GACCAGGGTGTCGTGGGGCATGGCTGA7GGAGAAGAAGA 

77AGAC7GGAG7TTGGGAATGCCACAG7A7CGAGGTTGGA7T7AA7CC7A 

7GGG7AA7AAAGCCAACTGTTCAACCCCCAACCCAC77GCAATA7GGCTC 

CAAAA7AGCAGG7G777GATAAAA7GAC7AC7777AC7C7AC7A77CCC7 

CCC7C77AAGAAGAAAAAGAAAG7GGAGGC7CAGAGAAAGGCAG7GGC77 

G7CCCAA7CACACTA7GA7T7GGCCACAAAACAAGAACGAAA7G77ACAC 

CCAAAAA7GC7GCCTCCACC7CCC7TCC77GCTTTCCTCCCTGC7GGAC7 

ACAGAC7A7C7CAAGAG7GACG7ACACCA7CAGGGC7TCAGC777TCCCC 

GAAACAA7GCCAAAATA77AGCCA7ACG7CAC7G7AG7AAGAGCCC7GAA 

77GGGAA7CCCAGC7T7GACGCAGACA7GC7GA77GAC7C7G7GACCA77 

CTC77CAC77C7CCAC7C7A7TC77CCCCACC7G7AAAG7GAGG7CC777 

CCAG77A7AAAAACAGA7GA7GC7A77G7CC7G7T77G7A7C7AA7C77G 

C7G7G77A7AAAAAAAAAA7AAGGC7C7G7ACA77CA7C77GGCCAA77C 

CC77C77A7C7C7ACT7CCCACAGCCCC77777C7ACAGAAAACCAGCA7 

7G77C77C7GGA7CCA7C7C77AAGAAAGCGC777GCC7CCCCGG77A77 

7AGG7GA7AAGAAG7G7CC7AGA7GACAGCCC7GGAA7GGGC7GGAGGCA 

ACAAAAAAGCAAG7GAAA7AGACAG77ACAGCGACGACAA7AA7AACAAC 
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CAACACC7C7CACTAAAGAGAAAGAAA7AAAAAAGAAAA77Aji^^ 

CGCAATGCCCACACAGTCATTGAATAACTGCATGTGTACAGCACTTGGTT 

ACTTTTACATACTTCATATTTTAGCCTTCATAGCAGCTCACAGGGGTGGA 

TTTAATTTTTAGTCCAACTCCTGTCACGGTGCCTGGCACAAGTATAATAA 

ATGTTCTG7GAATAAATGACCCTCTTTTTAGATGAGGAAATCGAGGCTCA 

AGGAGAACAAGCAATGTAATGTCCCCCTCCTGTTCAGCCATCTGCCTTTC 

ACGCCAC7GAA7GCAG7AG7CC7CAG7GCCC7GAAC77GACCC7C77C7G 

CTTTTCGGACTGGTCCTTCTAATCCCGTTGTGACTCACTACACCACCTCT 

CCTGCATATGACATCTACATTTTAAAACAAACCGTATGGAAATAACACAT 

TAGTCGGCTTG7TCCCCCACCCCCGCAAAAAAAAAGGCCTCTTTATAACA 

GAAACTTCTCAGGCTGGTAGGGGAATTTTATTCCCCCATTTATGGTAGAA 

AGGCCCTAACCTTGGACCTCACGCCATAGCTATTCACATGGGGGAATGAT 

GAATAACATGGGGAGCAGCATGTAAATATCATTGAGCCGTAGTCCAGACC 

TATAACACATC 

>Concig31 

GGGGGAGCTGCATGTGCCTGTCGAGA7CTGGGGGAGGAACAGGAAGATCA 
AGAGTTCTGTGTAGGACATGTTAAGTTGAAGGTGCTTACAGGATAGCCAG 
ATGAAGCATCAGGTGTGCAGTCAAAGATATGAGTCTGGAGCAGCACATCC 
TAAGTCACCTCCTGCACCAACACAGAACTTCCAGGCCACTCACTTGAGCT 
CTCCCAAATAG7TTCCAAGTGTCATTATGTTAATAACCTATGAGCTTGAA 
3 CACCAGA77CAAACCCCAC7GCA7GGC7777AAAGACCA7C7CAAGGGC7 
TGACAC7CCAGGGAGCCAAC7AAAGA7GCC7GG7CC7ACCA7CAACC7CC 
ACCCCA7T7TT7ATAGAAAATGTTTCTACCTGTCC7AAGGCAGGGTCCTG 
CCCCAC7CCCAGGCCCC7TTAGATCCCCAA7A77CC7CC7CCC7GAACCA 
AAACCC7CATCATCTTCCAGCATGGGTGGGGCCTCCATTCTTGCTTCTGC 
7CCCC7GAGCAGAAGCAAG777C7CCCAAC77GACC7GA77C7CC7CC7A 
AG7ACCAG7CAC7GC777G777C7GGAA7GAGAGAAAAAGACAGAG7GAG 
AGAGACAA7CCAGAAC7C77GC7CAC7CACAGC7AGGC7GGGCA7C7GGG 
AGGA7GGC7G7GTCCA7GGGAACC7GGGAAAAGCCACACCC77GGCACCC 
7GG7CACCCACC7G7C7CCC7GGCAGAT7CCGCAC7GC7C7C77GCACCC 
7C7ACCAGGGC7AACCGGCC7GC7CAC7C7CCCCAGCA7G7C77CCCACG 
CCCAC7C7C7AA77A77ACA77CCC77CACA7AAAC7GCCC77C7C7CCC 
AATCACCACA7G7TCAC77CCCACCCAGC7G7CAAAG7C7GGC7CAACC7 
CA77C77GAAAAGGAAAAAACAAACAAACAAACAAACAAACAAGCAAAAA 
ACC7A7GA7GGA77AAGAACACAC77CA77CCAGGAACA7GC77A7C7CC 
7C7AAC7C7CACAACAAC7ACAGCAGG7AGG7G77A7CACACCCA7C7C7 
CAGG7GAGAAAACAGGCTCAACGAG7GCAGGAGGACACAGCAAG7CAG7G 
ACAAAGC77AAA77CAAGCCCAAGCC7G77GGCAACCAACG7C7G7ACCC 
T7GA7AGC7ACC7CA777ACCACCAAA7CCAG7GGCC7CAGGCC7GGC7G 
CACAC7GGGA7CACC7GG7GCCCAGACCACA7C77AGACCAG7CA7ACAG 
AA7C7C77GGGC7GGGA7CC7CCACGG7ACA7777AAGGG7CCCCAGG7G 
AG77CCACCA7GGACCCAGAA77GAGGACCCAA7ACCG7A7ACCA7C7CC 
77C77CA7C7C77C7AAGGCA7C7C77AC7CGC7G7GCAC7CCCA7ACCA 
C777G77CAA7CATCCAA7CA77CA77CA77GAG7CAG77AG7CAGGAGC 
7AC7CAC7AG7CCCC7GCCAGG7CC7AG7CATGACA7AGGGC7C7GGGGA 
CCAACAAGAAGCAGGACCCA7GCC7CC7GC7C7CA7GGAGC77GC7C7GC 
AGCAGAGGAAGCAG7CAG7GAGA7G7AGCAAATG7GAAA7G7GCACAGA7 
GGGAAAAGCAAAACT77AAAAC7777AGGACAAAA7ACACAAGAAA7C77 
7GCAAC777GGGACAGGAAGGAACAACA77CC77ACACA7GACACCAAAG 
GAA7CAACCA7AAA7AAAAAGG7GA7CAA777GACC7CA777AAG7G77A 
AGC777777CA77GAGAGACACCA77AAAAA77AAAAA7ACA7GCCACAA 
AC7GGGA7ACAA7A777ACAACAC77A7G7C7CACAAAGGA77AG7777C 
AGAA7A7A7AAAGAAC7CCCGGCCGGG7A7GGCCGCGCACGC7GGAA7C7 
CAGCAC777GGGAGGCCAGCGGA7CACA7GAGG7CAGGAG77CAAGACCA 
GCC7GGCCAACA7GGCAAAAC7CCG7C7C7AC7AAAAA7ACAAAAA77AG 
CCAGGCA7GG7GGCGGGCGCC7G7AA7CCCAGC7AC7CAGGAAAC7GAGG 
CAGGAGAA7CAC7TGAGCCCAGAAAACAGAAG77GCAG7GAGC7GAGC7C 
ACA7CACTG7AAGCC7CGG7GACAGAG7AAGAC7G7CAAAAAAACGAAAA 
CAAAAACAAAAACTCC7ACAAA7AAA7AAGAAAAAAA7AGCCCAGCAGGA 
AAAAG7A7A7ACAT77CA7AAAAGAA7AAA7ACA77C7G7CAG7777C7A 
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ACA7ATATT7T7TAAGAGTAAATACAAAT 

ATGC C CAAC C7 CATTAAAAATTATAGAAGTGAAAATTAAGCCACAATAAG 

ATACGA7TTTATACCAAATACAGTGTCAACACTTTGCAAGTCTGACCTCA 

CCAAG7C-77ACCAGACGTGTGCACTGACGTGGCTGCTGAGATACTGATGG 

7GGG7C-7G7AAATCTGTACTACAAACAAT7GCAATAAAATG7AATAAATA 

7ACAA7AGG7GGAGCAGGAAGTGACCTGCAACCATATAGCAGATAGGGCA 

GGAAAAAGCC7ATGAAAGCTGACATCAAAGGGATAAGTTCCAGTTACCCA 

GCTGAAGGGAAGGAGGGTGTTTCAGATAGAGGAAGGATAAGCATGACC7A 

77CAAGGCCAG7GAAAGAAGCGTGCAACGGCCAAGTCAGGAGAACCTGAA 

ATTG7G7CAAAGAGCTTGGATGCAAAGAGCCGTGGGAGACTATTGGGGGT 

7TTAAGCAGGGATATAATATTCATTCAAGCA7GCAGTAAAAGG7CACTGG 

CACCTGCCATGGGCCAGGACTCGGGCTCTACATGATTGCGTCTGTTTTGG 

AAATA7CACCCTGGCTGTGAGATGAAGAACAGGTAGGAGGGTCACAAAAC 

7TGAAGCAGAGAGACTGTTGAGGAAGTAAGCTGTTTTTGTGTGGACTGTG 

GCAATCACAGAGGCAGAGGATATAAATGCACAGAGACACAAGGCATGTGG 

GAGGCAGAAGGAATCAAATACAATGAGTGATCAGATG7GGGGTTAGAGTG 

GTGAG7GAGAAGACATACTCAAGGTGACACGCCCAGGTATCTGGGTGGAT 

GGTAAGACATTCATGGACTAGGATCGAGGAANGAGGTGGGGAATGGGACC 

ATACC7GCAGTT7ATAAGGGGTGGACGAGGGAAGATTATGCGGGAGACTG 

AGAGAGGAA7AGACAAAGGAATCCCGGTGCAGTAT7ACAGAAAC7GGGGT - ~~ 

GGGAGGGGG77GTANTTCAAAAAGGAAAGAAAA77GTCAAA7AGTATGAA 

ATGC7GCAGAGAAACTCACGGATTTTTTTTTTAAGCT7AGAAT7ATTCAT 

7GAC7A7G7GAA7AAGAA7AAC7777A7GAAAGAAG7777GC77AAG7AG 

7AGGAAGAAGCAAAATTG7TGAGGGCTGA7GAG7GGGAGGAGAAGTAAT7 

GAAGGCAC7C777CAAGAGAAACAAAGCAGAAGG7GAGGAGAATACTAAT 

GAAGGAG77ACGGCCTTCACTATTTTGTTTTGCTTTAGATAAGCAAGACT 

TGAG7GGG7C7GGTGAGGAGAAACAAGTAGAG7ACAAAGTTAAAGGAGAG 

ACAGACAGAGATAGAGATAGGGACAGAGAGAGAGACAGAGACAGAGCACA 

AAAGAGCAAGGTCCCTGAGAACACGGGCCTTCTGTTTAAACCCCAGCCAG 

ATGTATTGCAATTCAATTCCAGTACTAACCACCCAGAGTTTGTGTAGACT 

CTACAAGTTAAAGAGCATGGTCCCCAACAAGACTGCTTCTACGTCAGATG 

CCAGGCACACTTCAGGGGTCCCCAAGCCACTCATGTTTTTTGAATGACTG 

CCATAAG7TCAAAAATTCCCACAATTCTCTCAGATTCAATAACTGGGTAT 

AACCAC7CA7AGAACTCAAGAAAATGC7ATCATTATTATTACAATTTTAT 

7ATAAAGGA7ACAAATCAGAAGGACTAGCCAAATGAGGAGACACATAGAG 

AGAGGAC7AG7AAAAAACAGAGCTTCTGCGTCCTACCTTCAAGGAATCAG 

GATGCACCACCC7CCCAGCACATCAAGTGCTCA7CAACCAGGAAGTTCCT 

C7GAGC7CCAA7GTCCAGAGATTT7AGGGAGGA7TCATTACA7AGG7ATC 

A77GA77AAA7CATTGGCCATGTACTTGAAC7CAATCTCCAGTGTCCCTC 

77C7CCC7AGAGGTCTGAAGGGTTGGC7AATATCATGTGGC7CAAAGCCC 

CAAC7C7AATTACCTTTTTGGTCTTTTCAGGGACTAGACCCCATCCTGAA 

GCTA7C7ACAGGCCCTGCCATGAGTTAGCTCATTAACATAACAAAGACAC 

TTATATTACTCAGAAAATTCCAACAGTTTTAGAAGCTCCATGTCAGGAAC 

CTGGGACATAGATCAAATTCTTTTTTTTTTTT7TTTTTTGGAGACAGGGT 

CTTGCTG7GTTGCCCAGGCTAGAGTGCAAGGACAGATCACAGCTCAATGC 

AGC7TCAACTTCCCAGGCTTAAGTGACCTTTCCACCTTAACCTTCCAAGT 

ATC7GGGACCACAGAAAATGGCTAATTATCCTGGCTGATTTTTAAACTTT 

7TT77777G7AGGGATGGGATCGCCCTGTGTTGCCAAGGTTGGTCTCAAA 

CTCC7GGG7TCAAGCAATCATTCTGCCCTGGCCTCTGTGATGGTTAATAC 

7GAG7G7CAACTTGATTGGATTGAAGGATACAAAGTATTATTTTTGGGTG 

7GTCTG7GAGGGTGTTGCCAAAGGAGATTACATTTGAGTCAGTGGACTGG 

GAAAG7CCACCCTTTCCCAGTGGACTGGGAGACCCACCCTCAATCCAGGT 

AAACACAATC7AATCAGCTGCCAGTGTGGTCAGAATAAAAGGAGGCAGAA 

GAACAGGGAAACACTAGACTGGCTTAGTCTTCCAGCCTACATCTTTCTCT 

CATGC7GAATGCTTCCTACCCTCGAACATCAGCCTCCAAGTTCTTCAGTT 

7TTGGAC7C7TGGACCTTCAACCACAGATTGAAGACTGCAGTGTTGGCTT 

CCC7G77777GAGGTTTTGGGACTCAGACTGGCTTCCTTGCTCCTCAGCT 

7GCAGA7GGCCAATTGTGGGACTTTAACTTGTGATCATG7GAGTCAATAT 

7CC77AA7AAAC7CAGATATATATATATG7A7CAGACATA7ATATATATC 

C7A77G7A7A77ATATACAGATATATAA7ATCC7ATTATATACAGATATA 
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TAATATCCTATTATATaOAGGTATATATATATATATGTATCATATATAT^ 
TATCCTATTGGTTCTATCCCTCTTGAGAATCCTGACTAATACAGCCTCCC 
AAAA7GC7GAGA77ACAGGAG7GAGCCACAGCCACCA7GCCCAGCCCCAA 
ATTCTTAATTATACAACAATGGGTCCAGAGATCAGGGCCTGGGTAGGATG 
CAGCAATAAGAAAACAGATGGTGGATGGGGACACATGTTGGAAGTGTGGC 
AGGACATGGCTGAGGGAACTCATAGGATGGTGTCTATTTTCATGGCTGAG 
7G7GAGGAACAGCA7AAGG7CAAAA777CAGG7CAA7GG7GAG777777A 
AA77G77GC7G7GAACCCCAAAAA7C7GACCCAGG7C7CAG77AA777AG 
AAAGTCTATTTTTCCAAGGTTGAGAACACCCACCCACTCACGACAAGAGC 
ATCAGGAGGTCCTGACCACATGTGCCCAAGGTGGTAAGAGCACAGCTTGG 
TTTTATATATTTTAGGGAGACGTAAGTCATCAATCAATATATGTAAGATG 
TACACTGGTTCTGCCTAGAAAGGCAGGACAACTTGAAGCAGGGAGGGGGC 
TTCCATGTCACAGGTAGGTGAGAGACAAACAGTTGCATTCTTTGAG7TTC 
TGATTATCCTTTCCAAAGGAGGCAATCAGATGTGCAATTATCTCAGTGAG 
CAGAGGGATGACTTTGAATAGAAAGACAGGCAGGTTTGCCCTAAGAAGTT 
CCCAGCTTGACTTTTTCCTTTAGCTTTGTGATTTGGAGGCGCCAAGATTT 
ATTTTCCTTTCACATTTCCCCCCCTTTCTTTTTAAGAATCTTTTAAAGAA 
AGCTTTTAAAAAGAAAATGAGTCTCTGGTCCCAGGTTTCATCTGAATTCT 
CGAGGGGAGGATGG7T7ATCCTAAACGGGTGGTTCTGAATTTTGAGAAAG 
7GCATTGTAC .... . 

>Contig32 

AAAAGC CA7ACGAA7GAGGAAGAA77AAGGGCCAGAACAAAACAAGAAGA 
TGAGGGAAAG777GGAAC77C77AGAGAC7GGC7AAA7GG77G7GACCAA 
AA7GC7GA7AG7GA7ACGGACAA7GAAG7CCAGGG7GACAAAG7C7CAGA 
7GGAAA7GGGGAA777G77GGGAAC7GGGCAAAGG7CACCC77GC7A7GA 
C7CAGCAAAGAAA77GGG7GCA77G7G77CA7G7CC7GGGGA7C7G7GGA 
AG777GAA7G7AAGAG7GA7GAC77ACGG7AGGG7A7C7AG7GGAAGAAA 
CC7C7AAGCAACAAAG7G7G77GC77AGAAA777C777C777C7777777 
777777777GAGC7GGAG7777GC7G7G7CGCCCAGGC7GGAGCGCAG7G 
GCGCAA7C77GGC7CAC77CAAGC7C7G7C7CC7GGG77CA7GCCA77C7 
CC7GCC7CAGCC7CCCAAG7AGC7GGGAC7ACAGGCGCC7GCCACCA7AC 
C7GGC7AAT77777AG7A7777AG7AGAGACGAGG777CACCA7G77AGC 
CAAGA7GG7C7CAA7C77C7GACC7CG7GA7CCACCCGCC77GGCC7CCC 
AAAA7GC7GGGG77ACAAGCA7GAGCCACCCCGCC7GGCC7GC77AGAAA 
777C7AAGCCAGGA7A7GGCC7G7C7GC77C7AACAGCC7G7GC7CAGGG 
G7AAAGAAA7GAC77AAAG77GGAACC7A7G777AAAA7GGAAG7AGAG7 
C7AAAAA777GGAAAA777GCAGCC7GGCC77G7GGCAGAGAAAGAA7CC 
AAG7AGGC7GCAGAGCAA7CA77GC7AGAGAGA77AGCA7GAC7AAAAGG 
3AGCCAAG7GC7AA7A77CAAGACAA7G77AAAAAGGCC77GAGGGCA77 
TCAGAGA7C7A7GAAGCAGCCCC7CCCA7CACAGG7GCAGAGG777GG7G 
CAC7AGGCCCAGAGG7777A7GGGCCANNGCCAGGGCCACAC7GC7A7GC 
ACAGC777GGGACAC7GC7GCCCGCA7CCAGGCCAC7C7GC7C7GGC7CC 
ACCC77GGC7CAAACGGGCCAAGA7AGAGC77GGACCAC7GC7CCCGAGG 
GCACAAGCCA7AAGCC77GG7GG777CCA7G7GG7G77AAGCC7GCAGG7 
GCCCAGAA7GCAAGA77GAGGGAGC77GGGCAC77CCACC7AAA777CAG 
AGGA7G7G7CAGAAACCC7AGGTTCCCAGGCAGAAGCA7GA7ACAGGGGC 
AGAGCC C77GCAGAGAACC7C7AC7AGGGCAA7GCCAAAGGAAAA7G7GG 
GG77GGAG7CC7CACACA7GG7CCCCAC7GGGGCAC7ACC7GG7GA7AC7 
G7GGGAA7GGGGC7GC7GCCC7CCAGACCCCAGAA7GG7AGA7GCAC7GG 
CAGC7GGCACCC7GAGCC7GGAAAAGC7GCAGGCAC7CAAC7CCAACCCA 
7GAGA7CAGCCACA7GGGC7AC7CCCAGGGAAGCCCACAGAGGCAGGGC7 
G7C7AAGGCC77GGGAGCC7ACCCC77GAACCAGC77GCAGGACA7GGAA 
7CAAAGA77A7G77GCAGC777AAGGC77AA7G7777CCC7G7CAA777C 
AGGC77G7G7GGGACC7G77GC7777T7 77T 77777777TTT7T777GG7 
CACAGG7G777GAACCAGAACAA77CCA7C77GAA7AGGGGC7GGG7AAA 
A7AAGGC7GAGACC7AC7GAGC7GCA77CC7AGGAGG77AGGAA77C7AA 
G7CACAGGAGGAGA7AGGAGG7CGGCACAAGA7ACAGG7AGCGAAGACC7 
CGC7GA7AAAA7AAG77GCAG7AAAGAAGCCAGCCAAAAC7CACAAAGCC 
AAAA7GG7GA7A7GG777GGC7C7A7G7CCCCACCCAAA7C7CA7C7CAA 
A77A7AA77CCCA7AA7CCCCACA7G77GAGGGGAGGACC7GG77GGAGG 
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rGATTGGATTATGGAGGCAATTTCCCCCATGCTGTTCTGGTGATACrbAG 

7GAG77C7CATAAGATC7AATGG7TT7ATAAG7G77TGGAAG77CCTCC7 

ACACACATGCTCACACTCTCTCCTGCAGCTTTATGAAGAAGG7ACTTGCT 

77CC777C7GCCA7GA77G7AAG777CC7GAGGC77CCCAGC7A7GCAGA 

ACTGTGAGTCAATTAAACCCGTTTTCTTTATACATTACCAGTCTTGGGCA 

3TTCTT7ACAGCAGTGTGAGAACTGCTGGCGATGAGAGTGACCTCTGGTT 

GXCCTCAC7GCTCATTATATGCTAATTATAATGTATTAGCATGCCAAAAG 

ACACTCCCACCATGACCCCAACAGTCATGCCTGTGCCGGTCTCAGCACCA 

TGACAGTTTACAGATGGCATAGCAACGTCTAAAAGGTACCCCATATGGAC 

TAACAAGGGGAGGAACCCTCAGCTCTGGGAAGTGCCTACCTCGTTCCCAG 

AAAGC77G7GAA7AA7CCAC7GC77G777AACA7A7AA77AAGAAA7AAC 

TATTAAGCATCCTTAGTTCAGCAGCCCAAGCTGCTGTTCTGCCTATGGAG 

TAGCCATTCTTTATTCCGTTACTTTCTTAATAAAATTGCTTTTACTTTAC 

TGTATGTACTCGCCTGGAATTCTTTCTTGTACGAGGTCCAGAGCCCTCTC 

TTGGG7CTGGATCGGGACCCCTTTCTGGTAACATTTTGACCAATTTCTCC 

CTTCTGGAATGGGAATGTTTACACAATGACTGTATCACTTT7GAATCTTG 

GAAGTAAATAATTTGTTTTTGACTTTACAGCCTCATAGGTGGAAGGAACT 

7GAC7TGAATTTCAGATGAGACTTTGGACTTTGGGACTTTTGGGTTGGGG 

CTGGAATGAGTTAAAAGTTGGGGGGATTATTGGGAAGGCACGATTTTATT 

TTGCAATATGAGAAGCACATGAGATTTGGGGGACCAAGGGTGGAATAATA 

TGGT7TGGATG7TTGCCCCCTCCAAATCTCACATTGAAATG7AATCCCCA 

G7G77GAAG7GAGGCC7GC7GGAAAA7G777GGA77ACAAGGC7G7CGAG 

CACA77GGA7AAGACG7G7AGGNCCC 

>Concic33 

CGCAGC7CGC7GG77AA77C7G7GGC7CC7G7GACCAC7A77A7AGCACC 
AGG7C7A7GACCAGGAGAA77AGAC7GGCA77AAA7CAGAA7AAGAGA77 
77GCACC7GCAA7AGACC77A7GACACC7AACCAACCCCA77A777ACAA 
77AAACAGGAACAGAGGGAA7AC777A7CCAAC7CACACAAGC7GC777C 
C7CCCAGA7CCA7GC777777GCG777A77A777777AGAGA7GGGGGC7 
7CAC7A7G77GCCCACAC7GGAC7AAAAC7C7GGGCC7CAAG7GA77G7C 
C7GCC7CAGCC7CC7GAA7AGC7GGGAC7ACAGGGGCA7GCCA7CACACC 
7AG77CA777CC7C7A777AAAA7A7ACA7GGC77AAAC7CCAAC7GGGA 
ACCCAAAACA77CA777GC7AAGAG7C7GG7G77C7ACCACC7GAAC7AG 
GC7GGCCACAGGAA77A7AAAAGC7GAGAAA77C777AA7AA7AG7AACC 
AGGCAACACCA77GAAGGC7CA7A7G7AAAAA7CCA7GCC77CC777C7C 
CCAA7C7CCA77CCCAAAC77AGCCAC7GGC77C7GGC7GAGGCC77ACG 
CA7 AC C7 C C CGGGGC77GCACACAC C77 C77C7ACAGAAGA CAC AC C77G 
GGCA7ATCC7ACAGAAGACCAGGCT7CTC7CTGGTCCTTGGTAGAGGGC7 
AC777AC7G7AACAGGGCCAGGG7GGAGAA77C7C7CC7GAAGC7CCA7C 
CCC7C7A7AGGAAA7G7G77GACAA7A77CAGAAGAG7AGGAGGA7CAAG 
AC77C777G7GC7CAAA7ACCAC7G77C7C77C7C7ACCC7GCCC7AACC 
AGGAGC77G7CACCCCAAAC7C7GAGG7GA777A7GCC77AA7CAAGCAA 
AC77 C C CT C77 CAG AAAAG A7GG C7CA7777CCC7CAAAAG77GC CAGGA 
GC7GCCAAG7A77C7GCCAA77CACCC7GGAGCACAA7CAACAAA77CAG 
CCAGAACACAAC7ACAGC7AC7A77AGAAC7A77A77A77AA7AAA77CC 
7C7CCAAA7C7AGCCCC77GAC77CGGA777CACGA777C7CCC77CC7C 
C7AGAAAC77GA7AAG777CCCGCGC77CCC77777C7AAGAC7ACA7G7 
77G7CA7C77A7AAAGCAAAGGGG7GAA7AAA7GAACCAAA7CAA7AAC7 
7C7GGAA7A7C7GCAAACAACAA7AA7A7CAGC7A7GCCA7C777CAC7A 
7777AGCCAG7A7CGAG77GAA7GAACA7AGAAAAA7ACAAAAC7GAA77 
C77CCC7G7AAA77CCCCG7777GACGACGCAC77G7AGCCACG7AGCCA 
CGCC7AC77AAGACAA7TACAAAAGGCGAAGAAGAC7GAC7CAGGC77AA 
GC7GCCAGCCAGAGAGGGAG7CA777CA77GGCG777GAG7CAGCAAAGG 
TA77G7CC7CACA7C7C7GGC7A77AAAG7A7777C7G77G77G77777C 
7C777GGC7G7777C7C7CACA77GCC77C7C7AAAGC7ACAGCC7C7CC 
777C7777C77G7CCC7CCC7GG777GG7A7G7GACC7AGAA77ACAG7C 
AGA777CAGAAAA7GA77C7C7CA7777GC7GA7AAGGAC7GA77CG777 
TAC7GAGGGACGGCAGAAC7AG777CC7A7GAGGGCA7GGG7GAA7ACAA 
C7GAGGC77C7CA7GGGAGGGAA7C7C7AC7A7CCAAAA77A77AGGAGA 
AAA77GAAAA777CCAAC7C7G7C7C7C7C77ACC7C7G7G7AAGGCAAA 
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TACCTTATTCTTGTGGTGTTTTTGTAACCTCTTCAAACTTTCATtGATTG ' 

AATGCCTGTTC7GGCAATACATTAGGTTGGGCACATAAGGAATACCAACA 

TAAATAAAACATTCTAAAAGAAGTTTACGATCTAATAAAGGAGACAGGTA 

CATAGCAAACTAATTCAAAGGAGCTAGAAGATGGAGAAAATGCTGAATGT 

GGACTAAGTCATTCAACAAAGTTTTCAGGAAGCACAAAGAGGAGGGGCTC 

CCCTCACAGATATCTGGATTAGAGGCTGGCTGAGCTGATGGTGGCTGGTG 

T1CTCTGTTGCAAAAGTCAAGATGGCCAAAG7TCCAGACATGTTTGAAGA 

CCTGAAGAACTGTTACAGGTAAGGAATAAGATTTATCTCTTGTGATTTAA 

TGAGGGTTTCAAGGCTCACCAAAATCCAGCTAGGCATAACAG7GGCCAGC 

ATGGGGGCAGGCCGGCAGAGGTTGTAAAGATGTGTACTAGTCCTGAAGTC 

AGAGCAGGTTCAGAGAAGACCCAGAAAAACTAAGCATTCAGCATG7TAAA 

C7GAGA77ACA77GGCAGGGAGACCGCCA7777AGAAAAA77A77777GA 

GG7C7GC7GAGCCC7ACA7GAA7A7CAGCA7CAAC77AGACACAGCC7C7 

G77GAGA7CACA7GCCC7GA7A7AAGAA7GGG7777AC7GG7CCA77C7C 

AGGAAAAC77GA7C7CA77CAGGAACAGGAAA7GGC7CCACAGCAAGC7G 

GGCA7G7GAAC7CACA7A7GCAGGCAAA7C7CAC7CAGA7GTAGAAGAAA 

GG7AAATGAACACAAAGA7AAAA77ACGGAACATA77AAAC7AACA7GA7 

G777CCA77A7C7GTAG7AAA7AC7AACACAAAC7AGGCTG7CAAAA777 

TGCC7GGA7A7777ACTAAG7A7AAA77A7GAAA7C7GT777AGTGAA7A 

CA7GAAAG7AA7G7G7AACATA7AATC7A777GG77AAAA7AAAAAGGAA 

G7GC77CAAAACC777C7777C7C7AAAGGAGC77AACA77C77CCC7GA 

AC77CAA77AAAGC7C77CAA777G77AGCCAAG7CCAA77T77ACAGA7 

AAAGCACAGG7AAAGC7CAAAGCC7G7C77GA7GAC7AC7AA77CCAGA7 

7AG7AAGA7A7GAA77AC7C7ACC7A7G7G7A7G7G7AGAAG7CC77AAA 

777CAAAGA7GACAG7AA7GGCCA7G7G7A7G7G7G7GACCCACAAC7A7 

CA7GG7CA77AAAG7ACAT7GGCCAGAGACCACAC7GAAA7AACAACAA7 

7ACA77C7CA7CA7C77A7777GACAG7GAAAA7GAAGAAGACAG77CC7 

CCA77GA7CA7C7G7C7C7GAA7CAGG7AAGCAAA7GAC7G7AA77C7CA 

7GGGAC7GC7A77C7TACACAG7GG777C77CA7CCAAAGAGAACAGCAA 

7GAC77GAA7C77AAA7AC7777G7777ACCC7CAC7AGAGG7CCAGAGA 

CC7G7C777CA77ATAAG7GAGACCAGC7GCC7C7C7AAAC7AA7AG77G 

A7G7GCA77GGC77C7CCCAGAACAGAGCAGAAC7A7CCCAAA7CCC7GA 

GAAC7GGAG7C7CC7GGGGCAGGC77CA7CAGGA7G77AG77A7GCCA7C 

C7GAGAAAGGCCCCGCAGGCCGC77CACCAGG7G7C7G7C7CC7AA7G7G 

A7G7G77G7GG77G7C77C7C7GACACCAGCA7CAGAGG77AGAGAAAG7 

C7CCAAACA7GAAGC7GAGAGAGAGGAAGCAAGCCAG77GAAAG7GAGAA 

G7C7ACAGCCAC7CA7CAA7C7G7G77A77G7G777GGAGACCACAAA7A 

3ACAC7A7AAG7AC7GCC7AG7A7G7C77CAG7AC7GGC777AAAAGC7G 

7CCCCAAAGGAG7A777C7AAAA7A7777GAGCA77G77AAGCAGA7777 

7AACC7CC7GAGAGGGAAC7AATTGGAAAGC7ACCAC7CAC7ACAATCA7 

7G77AACC7A777AG77ACAACA7C7CA77777GAGCA7GCAAA7AAA7G 

AAAAA7 C77 CC7AAAAAAA7CA7C77777A7CC7GGAAGGAGGAAGGAAG 

G7GAGACAAAAGGGAGAGAGGGAGGGAAGCC7AATGAAACACCAGTTACC 

7AAGACCAGAA7GGAGA7CTTCC7CAC7ACC7C7G77GAATACAGCACC7 

AC7GAAAGAAC777CATTCCCTGACCA7GAACAGCCTCTCAGC77CTGT7 

77CC77CC7CACAGAAA7CCTTC7A7CA7G7AAGN7A7GGCCCACTCCA7 

GAAGGC7GCA7GGA7CAA7CTG7G7C7C7GAG7A7C7C7GAAACC7C7AA 

AACA7CCAAGC77ACCT7CAAGGAGAGCA7GG7GG7AG7AGCAACCAACG 

GGAAGG77C7GAAGAAGAGACGG77GAG777AAGCCAATCCATCAC7GA7 

GA7GACC7GGAGGCCA7CGCCAA7GAC7CAGAGGAAGG7AAGGGG7CAAG 

CACAA7AA7A7C777CT777ACAG7777AAGCAAG7AGGGACAG7AGAA7 

77AGGGGAAAA77AAACG7GGAG7CAGAA7AACAAGAAGACAACCAAGCA 

77AG7C7GG7AAC7ATACAGAGGAAAA77AA77777A7CCTTC7CCAGGA 

GGGAGAAA7GAGCAGTGGCCTGAATCGAGAA7AC77GCTCACAGCCA7TA 

777C77AGCCA7A77G7AAAGG7CG7G7GAC7777AGCC7TTCAGGAGAA 

AGCAG7AA7AAGACCAC77ACGAGC7A7G77CC7C7CA7ACTAACTA7GC 

C7CC77GG7CA7G77ACA7AA7C7777CG7GA77CAG777CC7C7AC7G7 

AAAA7GGAGA7AA7CAGAA7CCCCCAC7CA77GGA77G77G7AAAGAT7A 

AGAG7G72AGGC777ACAGAC7GAGC7AGC7GGGCCC7CC7GAC7G77A7 

AAAGA77AAA7GAG7CAACA7CCCC7AAC77C7GGAC7AGAA7AA7G7C7 



FIG. 3 (27 of 52) 



WO 99/06426 PCT/US98/16102 

GGTACAAAGTAAGCACCv^ATAAATGTT^ 2" 

ATTAT7TTATTTTTTT7TTTTGAGATGGAGTCTCACTCTGTTGCCCAGGC 

7GGAG7GCAG7GGCGCAA7C77GGC7CAC7GCAAGC7C7GCC7CC7GGG7 

7CACGCCA7777CC7GCCTCAGCC7CCCGAG7AGC7GGGACAACAGGCA7 

G7GCCACCATGCCCAGCTAATTTTTTTGTATTTTTAGTAGAGATGGGGTT 

TCAC7G7GT7AGCCAGGATGGTCTCTATTTCCTGATCTCATGATCCGCCT 

GCC77GGCC7CCCAAAG7GC7GGGA77ACAGGCG7GAGCCACCGCGCCCG 

GCC7A77A77A77A77A77AC7AC7AC7AC7ACC7A7A7GAA7AC7ACCA 

GCAA7AC7AA777A77AA7GAC7GGA77A7G7C7AAACC7CACAAGAA7C 

C7ACC77C7CA7777ACA7AAAAGGAAAC7AAGC7CA77GAGA7AGG7AA 

AC7GCCCAA7GGCA7ACA7C7G7AAG7GGGAGAGCC7CAAA7C7AA77CA 

G77C7ACC7GAG7AAAAAAA7CA7GG777C7CC7CCA7CCC777AC7G7A 

CAAGCC7CCACA7GAAC7A7AAACCCAA7A77CC7G77777AAGA7AATA 

CC7AAGCAA7AACGCA7G7TCACCTAGAAGG7T77AAAATG7AACACAAT 

A7AAGAAAA7AAAAA7CAC7CA7A7CG7CAG7GAGAG77TAC7ACTGCCA 

GCAC7A7GG7A7G777CC77AAAA7C777GC7A7ACACA7ACC7ACATG7 

GAACAAA7A7G7C7AACA7CAAGACCACAC7ATT7ACAAC777A7A7CCA 

GC7777C7GAC77AGCAA7G7ATTGA7GACA77A7GCA7GC77AGACC7C 

C 

>Concig3 4 

G7A77C7A77CTCGG77A7AACAC^7CAC^G7GA777G7CA7A7€777C 

CAGGA777G77AA777CAC77C77CAGC7G777CCCCC77G77GGC7GGA 

AC7GA7777C7A7C77C7GGGAGAA7C77CAGCAAGCCAAC7CAGGA777 

G77GGG7GCA7777G7CAAG7C7AGGACCCAGGC7C7GGG7GAC7GA777 

CC7C7AA77ACCGAGCAA7G7AAAA7GAGGAAG7C7GA77G7G7AAAGG7 

G77AAAC7777G7G7GACGGCAAAAC777AA7ACCA7GAA7AGAGA77CC 

AGAA7777CCAAC77C7AACGGGA77CC777CAC7CCC7GACA77AGAA7 

G77AGAAAA7C7ACCACAAAACA7C7G7GAGGC7A7CC7ACAAGGCCCG7 

7777CAAAA7AGG77777ACAAGGA77GC7A777GGGA7GA7AG77TCAG 

AAAGGCGC7A7CAAAG77AA77GA7GA7G7G7GCAAGC7GAAAG77A7A7 

GTTAGAAC7AGCAG7GATTTCAAAAA7A7CCCT777AGGC7T777GC7AA 

7A7A7C7GC7CA7777CAAAG77CCCAA7A77A7AAAAC77777AAAGCA 

GAAAGAAGAACCC7CCAT7TC7GC7GGCCCC7TCCC7GT7CAAC7AAAAA 

G7A7777CCCAGGCAA7GCTA7CCCAGGAC7CACAC7CCA7CCA7CCATC 

ACC7ACCA7AAG77CTT7GAAGGGC7CA77C7GAGCGC77CC7GAG7GCC 

7GGGA7C7G77A777C7C7CCATT7C7GC7GC7GCA7GG7AG7CCAAG7C 

C7CC7CCC7777CCCC7AGGCCA777GAA7CA7C7GC7AA77GG7777CC 

7GA77GCCACGGAAAC77CC7CCA7CCC77CC7CACA7A7CAGCCACAGA 

AG 7 A7 C7 C C AAAAAG CAAA7 C7GG7GAC A7 GAAG C C C77G C ACAAAAC C Z 

A77CA77AC7GG77CCACACCTCC7T7G7GGA7AAG77CAAGC7CC7GAG 

7G7GGCAAGCAGGGCCCACCTGGAATCCCC7GCCC7CCTCTCCTATCCCA 

CGCA7CAA7C777CC7G7CTATT7GCAG77CCT7GAATGTGA7AT7C7TT 

CTAG7C7C7G7GC7T77GCATAACCTG7TC7TCC7GACTGGAAACTCCTT 

CTCC7CC77G7AG7T7GGCTAATTTC7AG7C77TCAAGAC7CAGCTCATG 

CTTCACCCCC7C7ATAACAAGTCCTTTCCCAAGCTGGGTGGTGGATGCTC 

C7C7G7GC7G7G7GAG7CTTGAACA7CC7CAGCAAACC7CAGCrr7G7T7 

GC77G7C7CCC77GC7GTCAA7GCACC7GA77CAGGGC7GGCA7A7ACTG 

77CACC7CCA7GAC7GGCTCA7GGTGG7GCTCCG7GAA7A7CA7CCACCC 

AAACGGA7GAGAGC7ACCATGCCA7CAC77GTGAC77CCATC7GGAGCTA 

ACC7CCCCCGACAGGAAAGCGT77CC77AGGAAAGAA7A7C777GGG7TA 

AA7AGAAG7AGAGAC7CACCAGAAGCAC7ATG7CCAGC7CAGAA7GAACT 

GC7CAG7AAGCAGCC77G7CAA7GAGGAGGCAGCAGGCCAGCCCCAGAGG 

CC7CAAAG7GGGAGAG7AGAGAAGCGCAG77CC7GCCACAAAGGCACAG7 

GGACACC77GC7CCCC7GGCTGGC7GGAAGCAGA7GG7GTCCACCTGCTT 

CCA7GGGAA77C7GCACC777AA7AAAG7777A7GGGACAGGAAGG7GAC 

7GGCA77GACA77G7AACGAGGAA7GGG7GG7GCCACC77TGC7G7G7C7 

7ACCAGAAA7ACC7G7GGCAGG7AAA777C7AGAGAGACCC7CCCA777C 

7CCCA7A7AGCAA7777GAAA7G7T7CC7GAGGGC777CCAAA77CATC7 

GGGAACA7AGGAG77CCAGAAAGA7GAAA7CAAAGG7GA7GG7A7GCCAA 

AGAAAG7AGC7777AGAA7GAC77ACA77AGCCA77CA7CCA77CAGCAC 
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ACCAGGCATTCAGTTTGAGGGGTGTGTGTGTG7GTGTGCGCGCGd3CG xG" 

CG7GCA7GAG7GCA7GCGCGCGCG7G7ACA7AGGGGAAGGGAAACAAAAC 

AAAAGTACACAAGACATGATAGTTGTCCTCAAGGAGTTTTTGCAAATG7T 

CACAA777AAGAGAA7A7GC7G7GC7G7GGC7GG7G7A7AAACCAAC7GC 

7AGGGAGAGGCC77CCACACACAC77GGGGCAAA7GCGACC7C7AGGAC7 

GCCAGTGGAATCTGGGCATGCTGTTTGTGGTCGATAAACCCTGGTCCCTT 

GATCAGGGACCTATGTTTACTTTTCCTCTCCCTGGAAGTCTTCATTAGTG 

GGCA7CCAGAAGG7C77GCACAGGGCAGAGGGAGGCACAAAGACAAGAG7 

77GAAAC CAGCCTGGACAACAAAATGAGTTTCTATCTTTACAAAAAAAAT 

TTTTAAAAAATTAGCCAGGTAGGATTGCATGTGCCTGTAGTCCCAGCTAT 

TCAGGAAGCTGAGGCAGGAGGATTCCCTGAGACCAGGAATTTTGAGGCTG 

CAGTGAGCTATTAAGTTGGCGCAAAAGTAATCGTGGTTTTTATCATTAAA 

AGTAATGGCAAAACTTTTAATGACAAAAACCGTGATTACTTTTGCACCAA 

TTTAATATGATTGCACGACTGCACTGTGCTCCAGCCTGGGCAACAGAGTG 

GGACCCTGTCACAAAATAATAAATAAATAAAATGTAAACATGTAAAAAAA 

ACCCCAAAAACAAAAAAAATGGGTGTTGAGACCCCTGAATTGAGGAATAA 

TAGGAAGGAGTGTGATTCTGTGTGTGCATGCATGGGTGTGCACCCTCAGT 

GCCTGGGTGGCTTACCCTGGGCTAGTTCAGGTGGCAAATGGTTTTCCTCC 

AGCTGGGC7ACCACCATCTTCCCCCAGGGCCTGTCCATGTATTTGGTGGC 

AAGATACCTATGGACTAGAGTCCCTCCTCAGAGGAAAGGCTCC7CCCATT 

TCTCTGGCTTTCAGGTAGTAGTCCATGACTTCAACAGGTCCCCAGTGCAA 

TGTTATGGGTTAGTTTAGGTGGGGTCTCCTCTGAGAGCCTCCCATAGCCC 

AAAAGGCCC7GTCCTAGCTGGCACTGCATCTCCCTCTTCCCAGCTCTCAG 

CC777C7C777GC7CA7CCCAC7CCGCACAGGC777C7GCC7GA7CC77G 

GATGTGTCAATCCTGCCCCTAAGGGATGCAAGGCAATTTGTCCTTTTATT 

ATTAAGATCTCTCCTGAGGCCACGTGTGGTGGCTCACACCTGTAGTCCTA 

GAACTTTGGTAGGCCAAGGTAGGAGAATTGCTTGAGCTCAGGAGTTCCAG 

GCTGTAGTGAACCATGATTGCACCATTGCATTCCAGCCTGTGTGACACAG 

CGAGACCCTGTCTTTTTTCTTTTTTTTTTTGAGACAGGGTCTCGCTCTGT 

CATCCAGGCTAGAGTGCAGCGGTGTTTTTCTGCTCACTGCAGCCTCAACC 

TGCACATTTTTTGTAGAGACGGTGTCTTGCTATGTTGCCCAGAGTGGCCT 

CAAACTCCTGGGCTCAAGAGATCTTTCCACCTCAGCCTTCCAAAGTGCTG 

GGACTACAGGCGTGAGCTACCGCGCCCAACAAAGACCCTGTCTTAAAAAG 

AAAACAAAAATAAACAACTCCCTCAAGTCTTTTTTTTTTTTTTGAGACGG 

AGTCTCGCTCTGTCGCCCAGGCTGGAGGGCAGTGGCGCAATCTTGGCTCA 

CTGCAAGCTCTGCCTCCCGGGTTCACGCCATTCTCTTGCCTCAGCCTCCC 

GAGTAGCTGGGACTACAGGTGCCCGCCACCACGCCTGGC7AATATTTTGT 

ATTTTTAGTAGAGATGGGGTTTCACTGCGTTAGCCAGGATGG7CTTGATC 

7CC7CACC7TG7GA7CCGCCCGCC7CGGCC7CCCAAAG7GC7GGGA77AC 

AGGCA7GAGCCACCGCGCCCAGCCAGACC7C77GAG7C77AAAC7CC7C7 

G7AG77CCAGCCACCC777AGCACA7GAC7C7G77AA7777G77C7CAC7 

G7C7GAAA7CA7C7CC7G7CCAC7C77GAC7GACAGG7C7C7GCAC7AGC 

CCAC7GC77AA7CAGAG7AGG7CCC7G7CAAC77A77CA7A77G7G7CCC 

CA7GCCAG7G7GGA7GA77AAAA77G77GAG7GGAGGC7GA7CAGA7GAG 

CCA7C7CC77CCAAG7CC7CAC77GC7GGC7CC7G7C77AG7777AG7CC 

CCA77C77CAAAGAACG7GAGCCC7GGAAAG7A7777AG7CA777AG77C 

AG7GCC777GGA7GGGAGGA7CACA7CCC7GGG7CCCG7CC7GCAGAC7G 

7777GC7C7AGC7GAC7AGGCAGGA77CCC7GCC77C7C7CAC77CGGCA 

7GGGAC77CC77C7GAAA77GC7GC7CAG7CAAGAGAA7GACC77CCCCA 

ACA7AA7CC7AC7CCACAGGGAC77AAAGG7G7G7CAGAGA7C7C77GC7 

CA7C777C7GGCCAGG7GCCAACG7CAG777A7AGCCAAGGGACAAGAC7 

AG77AGCAGA7CAGGCAGG7C77AGACCCCAGCG7AAG7GCCAGAC77C7 

AGC7GCAG77G77CC7GCCCACAC7GGGCG77CAGG7GGAGAGAGGGCA7 

GGCAC7ACAC7GAGC7C7CGGCGAAACCCAGGAC7C7GAAA7C7CGG7G7 

CAGCCACAGGCCAC7C7777CAGCAGGAC77CAG7CAG7CC7G7CAC7AG 

GC7G7CGAGCACA7GG7AGGC777ACCCC 

>Contig35 

AAGGAG7G7GC77GC7GA7AGCA7G7G7GANGGGACGAGGAG7AAA7AA7 
77C7GCC77CAAGAAA77GCAAAC7AG7AA7GGAGA7AAAA7CAACAGAG 
GAACAA77AGAG7A7AAGG7AAAA7C7AAGGGCCA7AAGAGAGGAGAAGA 
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AGTATGGGAGTTCAGAGurAGGGGGTAAATGAGGGGAGTAGGTGGGTA(^ 

AAAGGTTAAAAGTAAATAATGATGGGAAGGAAGACAAAAAGACGACAGGG 

GTGCCAAAGGACTCTTAACCTCATC7GAACGGAGTTGCCCTGTTTTGCTC 

7CTGATGC7CATG7A7C7A7CC77AGAGACAGC77GGCGGGCAA7G7AGA 

GCG7AGGGGC7GACA7AGGGGG77GGAG7CCCACC7CCG7GACTTCTAGC 

AAATTAGCAAACTTTGCTGCTGCTAAGCC7ATAAGGCGGACAGAAATGCC 

A7C777AAAGC77GT7ATG7AAAG7GCC7AGGACC7CG7AGGCATCAACA 

GGAA7AA7GGA7GAAACAAAACAACGG7GCG7A7C77GGAGAAAG7GGCA 

TCTGAGCAGGAGTATTTTGAAAGGTAGGAAAGGGCTCCAAGCACATCTAA 

GAGAT7AGGGAACGCAGAAGCCTTAGCCCTGGGTGCAGATTTAACCAATC 

AAC77C7AACCACCGCAGGC7GAGAGG7G7GGAG7GAGAGCCCCGCCAGA 

GGCAGGAGACCCGGGCTTCGGCCAGACCCCGCCTCCTGGTACAGAGGACC 

ACGCCCGGCTCTGCCTGGAGCCAAATGTGGATCAAAACAGCGCGCAGCTT 

CCCACTGCTGGTGAAAACCCGAGCAAGGGGCCTCAGTTTCTTTATCCGGA 

ACGTGGTGACAATGACATCTCTTTGCAAGGCTGCTGCAGGGCTTTCTGGA 

AATACGCCCGTGAGGTATCTGGGCCTGCGCACAGCCTCCCCCGCCCAGGA 

CCCAGACGTCTACCTGGGGGTCCCGTCTGCGCTCCCGGGATGGAAAACGC 

CCAGGGGAAACTTAGGCAGGCGAGCGGACGGGCACCTCCCGCGGGACGAA 

CTCAC7CGGTGGCCTCCTACTTCCCCGGCCGTGTTCCAACGCCTGAGAAT 

AACGGGAACAGCGGTCGTACTCACCGACAGCGGCAGCAGCGGTAGGCCCG • - 

GGCCCCACCATGACTCTTCAGTGACAGTTTTTCTTCAAACGCCGCSCCTG 

TAGCCAGGACCGGCGTGCCGCGCG7CCACGCGTCCTCATTGGCTCCTGCG 

GG7T7GAAAC7CGCTAG7CG7CAGCACGGGAGGGCGGGACAACAGGCAA7 

AGGC7C777GCGGTTGGCTCTGGCCTTGAGAACCCGACCT7GGGGCCCTT 

TGA77GGAAGAACGTGCAGCGCACC7CGGCA77GAGGGCGGC77CC7CGG 

GGCGCGGCGCCGCCCGCC7C7GAG7GCGCC7G7GAG7GCGCC7CCGAG7G 

GGCG7GGGACCC7CCGTGGGGGCC7CAGCCGGGC7GG7GG77GGGGGGCG 

G77ACGC7GAA7CCAGC7GGGG77GGCGCGCCGGGAGTCCC7GGGCGGAG 

AGACAGGGCGG7CCTCCCAGGA7GC7GGGGCCGC7ACC7GA77C7G7CC7 

77CAAAG7C7CAGACTCACAGGAGC7G7GAAAAAA7AATA77ATAAAGAG 

GACA7A7GGG7C7TATGCATC7AAAGGC7CC7AG77CTTAG7AC7GCAGG 

G7GGC7CG777AATTGTGG7AAAA7A7GCA7AACA7CACA7A7ACCAT77 

7AACCA7777AAAGTGT7AAA7T777CAAAAA7G7GCAG777AG7GGTA7 

7AAG7ACCC7CACATTG7GGCACAGCCACCAC7AC7GTCC777CCAGAAC 

77777CA7C77CCCAAA7GAAACCC7G7ACCCG7CACTAAC7CCGCAC7C 

C7CCC7CCCCCAGCCCCAGGCAA7CACCA77C7AG777C7G7C7C7ATGG 

A777GACAAC7G7AGGTGCCA7A7AAG7AGAA7CA7GCAG7A777G77C7 

37GAC7GGC77G77TCAC77AGCA7AAAG7A77CAAGG77CA7CCA7G7G 

TAGCA7G7G7CAGAATT7CC777CC7777AAGGGGGAA7AGCA777CG77 

G7G7GGAGA7GCCACAT7T7GC77C77GG7CCA7CCCTC7CCGGACAC77 

GAG77GC77CCACTTTT7GGC7A77G7GAA7AA7AA7A7GAACA7GAA7G 

CACAAA7AAC7C77TGAGAC7CTCC7777CA77C77TTGGGTATATACCA 

CGAAG7GG7A77G7TGGATCAAACGGCAA7TC7A777T7AAT7TT77GAG 

AAAC7GCC77AC7CCTC7CACGG7GA7CTCT7G77CAAGG7A7ATT7TCG 

A77TCACC7GA7CAGCTGACTATAAGGCCATAAGGCTAACGGAGAAACGC 

AGGCC7AG777CTCCTAGTTACTAGGAGATCGCAGGCCTCGTTGTCCTGA 

A7CCC7AGACACACTTCATTCCCC77G777TAA7CC7AAA777T777TC7 

777GAAG777G7CCTGT7TCATC7A77C7CCAG777CT7AAAGAGG7C7G 

GAAAA7GC7777GGCTCCTTG7G7A7GAAGG77CC7CT7CCA7GGA7GC7 

GGAGAAG7CG7G7GTGGAGGGGCAG7CA7A7C7GGGCACC7G7TGGCCAG 

G77CAGC77ACCAGTTGGG7AC7CAGCAGGGCA7GAAGCCAC7GCAGCAG 

CCC77C7C777AGCCGTAAA7AGGGAG777GGAAGAGAGCCAGGG77TC7 

GGA777A7GCA777TGA7A77TTCAA7AG7G7A77AAA7G777AAAATAG 

GAAAAC7GA7CATTATTTT7G77AA7GAC7GAGAAAGGGACTCC7TCACC 

AACAG777CAGAAAAG7GAAGGCGG7777G7777GG7C777G7AGAA7C7 

AGG7GG77GAA7GCATGTCAGT7G7AGAAG7CACC77GCC7GATA7CCCA 

CGCAGTGCTGGAGTATTCCACAGACCCCATGTAGGTACTGCACCTTTGCA 

GG7A7AC7GC7GGTGTTGG7GAGC7GCC77ACC7G7CC7G77A77GGAGA 

CCCC7GC77A77AGGAAAC77AAAA7GAAC7CAAA7GAGC77CC77GC77 

AC7GG7CC7AGTCCTTTGGAGCAACATAGGCCAGTTCTGCCTCGTTTTTT 
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TCCATCCTTTGGGTATTTGACGGTCTATTTTG7AGGACACAAAATG7GGG 
AAAA7AGC7AGGCAGG777AAAAA77C7CAAC7C7ACCAAGCA7GG7GGC 
TTATGTCTGTAATCAATCCCAGCACTTTG7GAAGCTGAGGCAAGAGGATT 
3C773AGCC7AGGAG777GAGACCAGAC7GGGCAACA7AGCAAGACC7CG 
T77C77AAAAAAAAAAAAAAAA77ACAAAAA77AACCAGGCA7GG7GGCA 
CACACCTGTAG7CCCT7CTACTCAGGAGGC7GAGG7GGGAGGATCACTTG 
AQCCCAAAAGT7GAAGGA7GCAG7GCAC7G7GG7CATGCCACCGCAC7CC 
AGCA7GGGAGGCAGAGCAAGACCC7G7C7CCAAA7AAA7ACA7AAA77AA 
A77C77AAC7CA77CA7CAAAG7A7CCAC7G7AGC777CCA7CA7CC7GG 
TG77GT7TT7777AGAAGGA7C7GGC7CCA77GCCCGGC7AGAG7GCAG7 
GGCA7GA7C7CAGC7CAC7GCAGCCCCCACC7C7C7GGC77AAGCGA7CA 
CCCAC77CAG7CACCCA7C7GGG7AA77777G7A777777G7AGAGA7GG 
GG7777GCCA7G77GCCCCAGG77GG7C77GAAC7CC7GGC7CAAGCGA7 
CCA7C7GCC7CCA7C7CC7AAAG7G77GGGA77ACAGG7G7GAGCCACCA 
CACCAGGACAA7CC7GG7GGC7777AACGG7777CCA77GC7C7CAGGC7 
AA7GACC7A7AAGCCCC7GCGGGC77GGCC7777AC7CCC7GAGCA77AG 
CCACC7CCC77AGCC77AGCCCACAC7AC7C7CCCC77GC7CAG7G77A7 
CCAGACAC777G777777CC777CCA7AC7CC7C7C7G7C7GGGAA7CCA 
ACC7T7CTT7C7CA777C7C7AG77GA77A77A77A77777AC7C7AGCA 
GCC77A77GAGA7A777ACA7ACCG7ACGA77C7CCCAC77ACAG7G7AC 
'AA77CXAT777C7AACA7777CA7CACCCCC7AAAGAAACCC7A7AC7CA 
77AGCAG7CAC7CCCCA77C7CCCC7CC7C7CAGCCCC7AGAAACCA7GA 
A7C7AC7A7CCA7C7C7A7AGA777GCC77C7GGACA777CA7A7G7A7G 
AAA7TA7GCAA777G7GG7C7C7GA7GGGC77C7777G77ACCAAAA7A7 
CA7GGG777GA7C7AGG7CC7GC7GC7CGC7GCACAGAAAGCCAGCCAC7 
GAGA7GACAAG7A77GCCAAGGAAGAAGGC777AG7CAGG7GC7GCAGC7 
GAGGAGA7GGGGGC7CAA7C7CAAA7CCA7C7CGC7GACC7AAAACCAGG 
GG777GGA7AGCAGGGAAGAAA7G7AACAA7GCG7AAGAAAACAGGAACC 
AGGGAGGGGCAAGGAAGCAA7CC7GA7GAA7GAG7GG7CCAAAG7C7CA7 
TGCC7GGA7G7GG7GA7C7GGCGAG777CAG77C777GA7AC7777777G 
AGAGGCC7GAAG7C7777CCCCAGGAAGGAAC7CAAACAAAACAAA7ACA 
AGC77CCAGC777AAGACCAGAAGCG7CAA777C7A7G777A7CCGAAAG 
AACAG7C7A7GGGAC7A77GG77AAG777CAC777CAC77AG7A7GC7G7 
777CAAGG777A7CCACA7AGCA7G7G7CAG7AC77CA77C7777A7GAC 
7GGG7A77C7A77G7GCGGA7A7ACAA7A7777A777GCCA77CA7CAG7 
7GA7GGACA7C7AGG77C777CCAC77777GGC7A77A7GAA7AA7GC7G 
77A7GAAC777CA7G7A7AAG77777G7G7AGACA7A7G7777CAACAC7 
CA7GGG7A7A7ACC7AA7GAGAGGAA77AC7G7G7CA7ACGA7AA77C7A 
7C777AACCA777GAGGAAC7GCCAGAC7G7777CCAAAGCAGC7GCAGC 
A7777ACA77CC7ACCAGCAG7G7A7GAAAG77CCAG777C777ACA7CC 
7CAACAACAC77G77A77G7CCA7C7777AAA77ACAACCA7CC7AG7GG 
77G7GAAA7GG7A7CACA77G7GG77777A777G7A777CC77GA7GAC7 
AA7GA7G77AAGCA7C77777A7G7G777AC7GGCCA777G7A7A7C7C7 
A77CAGAG7C777GCCAA77777AAA77GGG7CAG77G7C77C77CC777 
777777GAGA7GGAGCC7CAC7C7G777CCCAGC7GGAA7ACAG7GG7G7 
GA7C7CAGC7CAC7GCAAC77CCACC7CC7G7G77CAAG7GA77C7GG7G 
CC7CAGCC7CCCAAG7AGC7GGGA77ACACGCACC7GCCACCA77CCCAG 
C7AA7777777C777G7A7777GAG7AGAGACGGGG777CACCA7G77GG 
CCAGGC7AG7C7C777G77GAC7C77AACCA7CC77CAG7C7CAGACAAA 
ACA7CCC777C7CAAGGA77G7GA77AGC77GA77A777GC77A7C777C 
7CCC7GC7AG7C7G7AAAC7GAGGG7AGGCCAC7A7A77CA77G77C77G 
GCACCAAA7AGAAAC7AAA77AA7G7C7777GAA7GAA7AGGGC777C7C 
C7777AAAGA7CCC77CAA7ACAG7AACCACAC7A7A7A7AAG7AGCCAC 
AAGCCCA77CAA7AA7AC7AC7AG7NC77GCGCCAAACC 
>Concig36 

GGC7CAGCG77AC7A7AC7GG7C7CAAAC7CC7GGGC7CAAGCGA7C7GC 
CCCCC7C3GC77CCCAAAG7G77GGGA77A7AGGCG7GAGCCACGG7GCC 
7GGCC7CAAA7AAC7A777AAG7GAAACAAAAC7AG7A7GGCAC7AA7GA 
AAAA7G7A7AAA7CCA7AA7CGCAGAGGGA777CAAC77AC77C777CGA 
77A7G7AAAGG7CAAACAGACAAAAGACAA7GACAAAAC77AA7GCAA7G 
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AACACTTTTGATTTAATGAACATATATTGGATATGTAC CCAAGAXTTAGa 
3AA7ACA7AC7AG7777GAG77TATGCAGAACA7T7ACAAAAA777AG7G 
GAAGCC7AAA77A7AAAAAG77GC7G7CACG7AGAA7AACACACAAACCC 
ZTGAG7CCGGAATTCAAAGCCCTCCACACTCTCCTCTACCTTTGCATCTT 
7A7CC7CCACCACAC7GCAG7GCA7AC7C7GGGC7AC7AC7CAC7G77C7 
TGA77CAAA77CCA7G77C7G7CAGC7CAAA7CA77C7C7C7GCC7GGAA 
7A£C7AC77CA7ACA7A77C7GC7A77GAA77C77G7C77AGCACCCCA7 
C7AC7CCAAGACGA7G7CCAG77GGGG77AC7CCC7G7CCCA7777C777 
GA77ACAC777777777C7AC77CCA77A7A77A77GA7CACA7C7G7GC 
CACAG77777GAC777G7G7C7GC7777AC7C7777C7AGACCC7GA7AG 
C7CC7GAAGGG77GGG7CA777C777777A777GC7CA77CC7CA7GGCA 
CAG7GAG7GC77AA7AAA7GGC7A77GAC7GAAA77AAAC7G7A7C7AAA 
7GGACA7A77CC^C77C7GGGCCA77CA77Crr77C777C7A77GGAACCA 
GGAGA7GGGGAACCA7AACAAAGG7AAGG77G7GCCA7G7GAAAGAACA7 
GGAACC77CCCC7GAGGGCCAAAAAAGAGCAGGGAAAGG7GCAAAGACAA 
AA7C77CCA77777AAACAA7G7AAGAA7G7GG7CCACC7CATGC7CAGG 
7GGGAC777A7CA7GACG77A77777GGGGAC77A7AGC7GCA7CA777A 
CCCCA7A7ACA777ACC777AG7G7AGGGAAC7GAGGACAGGAA7777G7 
7GA7GCAGAC7C77GC7AA7GAGGC7AACAC77GGAGAA77777A7CA7G 
CA77CAAGAAGC77G7777ACA777C77CA77AA7AC777AG77GG7GG7 
77AGC777AG77G7AGGC77A7CAGA7A777GGAGA7A7C77CA7AAACG 
A7GGC777GG7777AGAAGAG77A77C7GAAGC7AC7A777C7GGCAA7A 
A7CAAACAGCA7GGCCA777G7777G7AAGGCC777CC7AGAA7A7GACG 
G7AAAA7C7ACG7G7GGAAAAA7GC77A77C77C7G7CC7C7A7AAA7G7 
GAA7C7AG777G7C77CAAAA7GAAA7CAAG7GA77AAAA7G7AG7777C 
TAAGAAGA7AAA7GGAGCAAAGCAC7C7G7G777CACAG7G77GGAAA7C 
AC7CA7CCC7CA7AAAAC7G7CCCAAC7GA7CC7GAC7CACA7GAA7GAA 
77AAAA7AAGAG77AA7AACA7CAA777ACA77777AAAGACAC777CCC 
A7G7777AGAC7A77GG77GGAAAAGC7GG7AGG7G7ACAA777G7GGAG 
AG77GGC7G77777G7C7G7CG77G777GACG7A777CAAAGCCA7A7C7 
AA7777G77GCAGAA7GG7C7GAA77C7ACAAAAA7G77GAG77G7G7AG 
7G7GGAGAAG7ACGGAGCCA777AC7GAAAGGC7GGGGGGAAA7GACGAG 
ACCC7GAGA7AAGGCAG7AG7GG7GCGAACAGAG7GGAAGGGAGG7AG77 
GAGA7A7G77CAGAG7AGAA7CAGAA7GGACA7AG7GAACAAC7GGA7GC 
AGG7GGGGGC7GAGGAAGCAAAG77GAGGA7AA77C7GAGAC77C7AGG7 
7GA7CCAC7GAAG77ACA77A77CAACACCACAAGGAAAC7AGGGGAA7G 
AGAAGGCA7AC7GG777GC777GGAG7GGAAGGGCAG7GA7G7AAGAGGA 
G77AA7GAG77AAAG777GGA7A7GCC7GAAC77CAA777GA7A7G7GCA 
7C7GA7A7ACCG77GGGG7GACCC7CCAGGCAA7GG77GAACA7G7G7A7 
77C77AG7AAC7GA7AGGCA7CACAGAC7CACA7CAG7AAGGAAGCAACA 
GCAAAC77GA77GGACGA7A7ACC7GGAAC7CAG7ACCC7A7GAC7GGAG 
CAAG7C7C7G7CAG7GAAA7GAGGA7AAGAAGAA7C77GACC77G7GGAA 
7ATG77G77AGGAA7A7A7G7GA7GAACAACA7AGGA7AC77CC7ACAGG 
GC7CCACA7G7AG7AAGGGC777A7AAA7GC77GA7AAA7A77A77G77G 
7AA777A777CCAAAG7AAGA7GCCAC7GGAGGAA7C777GGAACCCAAA 
77AA7AACAAA7AGGAC7GGA7GCAA7GGC7CACACC7G7AA7CCCAGCA 
C777GGAAGGCCAAGGCAGGAGGA7C7C77GAGCCCAGAAA77CAAGACC 
AGCC7GGG7GACACAGGGAGACC77G7A7C7A7GAAGAA77AAAAAAAA7 
7AACCAGA7G7GG7GG7GCACGCC7A7AG7CCC7GC7GC77GAGAGGC7G 
AGG7GGGAGGA77GC77GAGCCCA7GAGG77GAGGC7GCAG7GAGCCA7A 
A77G7GCCACCACAC7CCAGAC7GGG7GACAGAG7GAGACCC7A7C7CAA 
A7AAA7AAA7AAA7AAA7AAA7AAA7AAG7ACAAACCAGCAAACAC7AA7 
CC777C7AGAGA77AT7GAAC7C7GGAGGGCAGA7C7GAA7GGAGCCAGC 
AGAGGGACC7A7GGAGA7CAGCC7GGCCC7GGACAGCACCAGGCAA7GGG 
G77GC7AGAGAGG7AA7GGGG77GAACAGGG777AAGCCA7GAGG7C7CA 
AGAA7CCG7GAAGAC7CAGAC7AA7777777777777GCA7GAGGA77AG 
G7G77CC7AGGAA777CAA7GAGAGCAGGG77AA7GAAGGAA7GCAGGG7 
AGGAGAGC7GAGGGAAGGCA7C7GAGAGAGCC7GGC77A7GAA7GGC7GC 
37CAGTATGGC7CACC7GCTTTCCTTGTATCTACTTAGCAGATGATCCCA 
GCCGAGGCC7CCAGGGCCAAGG7CA777CCACA7AG7CA7GGGCCC77GA 
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GGGCCTGGAGCAGTGTA^oGAAGACAGAGTCTTAAGAAATTGCA^^AC., 
G7CA7GG7GC77GGCAAG7G7CG7CA7CC7A7GCCAAGCC7GA7C7GAAG 
GGG7GCA7GC7CATAGG7AGC7GC7GCCCAAGA77ACAGCAGC77C77CA 
ATCCCAGATCCATGCTCTCCTATATTCATTT TT CCAGGGGTTCCTGTCCT 
7CGACAG7GA7GAGA7GCAGAA7GAC77A77GAG77A77C7CC7GA7AG7 
TGCCAAC7777CCAAA7GACAA7GGGG.CA7GGAGC77GAGAG7GGAAA7G 
AGGCCC7AGGGA7AGCG7GC77AGGAAAACAC7CCCAGCC7GA7G7AA77 
C7GGGGG7ACAA7GGCA7777CA7CA7CAAGAC7GA7G7AAAGGG7GAC7 
AGCAG7GAG77GGGGG7GAC7CGCAC7GGGGC7AGG777C7GA77C7GCC 
TAA7C C AG ACAGAGCAGAAGCAC7AG7GGGC7GG7AGAGGGC C7CCAGGG 
CC7CAC77AA7G7CC7GGAAAAACAGC7CCAGA77G77GG77CACG77C7 
GAGGACAAGC77GGG7AC7ACAGGA7AGAGAGAG7GG7GGGAGA7GCCG7 
GGCC7GCCCTGCTGATGCC7GCCCTGCCATTCCTGCGTGTGA7GTCTCTG 
GGGCA7C77GCC77CCC7GCCCAGACC7G7AG77CAGC7GAGGGCA7G7G 
GAGGCCAAA7GGC77C77AGAG7G77AC777CC77GAACAGC7C7GC7GG 
GAGAAC7GGAGGAGC7AGC7AG7CACGG7AAC7GCAGCAG7CAAAGGA7C 
G7CCCGG7GGAGG7GGGG7GGAAAGG7AGAGAAAGAGAACA7A7AGCG77 
T7CC77GGAGA7G7G7GGGCA7G7CA7AGAGGAAA7ACCCAA77CC7GAG 
C C77GAGCC C7CCAGGAAACC77GGAA7A77AGG77AG7CA7CCCCAAGG 
AAG7C7AAGAA77C7GG7C7CACCCA7C7CC777AA77CCCACAA7GA7C 
^ C7ACA7GA7A77AAGGAACACGGGCCAG7AAGGG7CCAAGCAA7GGA7G7 
3G7GG7GAAG777GACC7CA7GA7GGAGCGGAGG77GG777GAAACC7AA 
GAA777AA777A77G777CAAAC7G77C7CCAC7CAGCG77A77AAAGCA 
7ACA7AA77GACACA7AAAAA77G7A7A7G7C7ACGG7G7ACAA7G7GA7 
G777CGA7C7A7G7A7ACA77G7GAAA7GA77ACAACAAGC7AAA7AACA 
7ACCCA77CA7CG7G777CAAAGGAA77AAAC7CAAGCACAAAAGAGAGG 
TGC7G77GAAGAG7AGGGC7GC7C7A7C7AAG7AG7A7G7C7GGGG77G7 
CC7GGA7CAGGG7CC7777G7GC7AG7AA7AAACCAGCCC77C7GGGGC7 
GC7CCAC777CCCCACA7777C77C7GGAGCC7CCC7AAGAA77AGGACA 
7GGCCAC777C7C7GCA7AGGC77CC7AC77CAACAAGGACAGGGC77G7 
GC7GCCCCA7GCCAC77GAG7G7CCC7ACAGCACAGAGC7GAG7GCACAC 
7GGC7GAG7GAGGAAA7CCCCCAGA77AA7C77GG77C7AAGCA7CA7GG 
C7G7A777CACACG7A7A7GAA77ACAAA77ACAGCA7AG7CGAA7AAGG 
A77777G7GC7ACAAC7GGAA7CCCAGA77A7GCAAA77GGA7AG7A7AA 
TA77GAAA77CC7AGGAC77777A77AG7777AAAAAA77A7ACAAGC77 
AGAG7AAGAAA77AAACAG7GCAAAAGAA77CAC7G7GAAAAG7AAAA7G 
C7C7G7C7C7GC7GAGAGACAGA7A77GCAGCCCAGA7AC7AC7GGGG7C 
AA7AG777CC777AAGCA7GCCA7777GA7GG777A7GGGAC77ACAGC7 
~AAGAAGC77GACAC7AGGG77GA7C7CAGAAAA7CA77G77GCAGG7A7 
7AGA7A7GACCG7C7CA7AAAGA7ACACACACAGACACAGCGA77GGAGA 
TA77CAC7GGGGC77A7GGGC7GC77G7CC777C7GC7C7G7GCC7AAG7 
7GGGC7CAGAG7AGCC7GGCA7CGGC7G7GGGGAGAA7GC7GGCA7GGGG 
77AGCAGGAGCCCAC77AACA7G7CC7AAGCCACC7GGAAGAG7CC77CA 
AGGAGACCAGAC7CCAGAGGCCC7AAGGAAGGAAGGAC7777GCCCG777 
77AGG7A77C7AG7CCCAGAG777AGGGAGGAA7GG777GGC777GGG7C 
G7G7GCCCC777ACCGAG7GGGA7GGGA7GTGCCCA7GAGC7G77GAGC7 
GGC7C77GGAGAAGACAGCAAAAGCGGGAA7AAGAGG7CAGGAAGC7G7G 
7GG77G7AGGAAA7CCCAGCAGAGGGCC7GGGGG7CAAAAG7GG7CA7GG 
7AG7GACGG7GGAGGC7GAGG7GG7AGAAAA7CAGAGGACAAACCCCA7G 
GGC7GC7GG7GA7C7GACCGAGC7CC7A7GC7C7CC7GG77CA7777AGG 
C7C7G7AGCAGCAGA7GA77GGC7GG7G7GAGAGCAG7GCACC7GCCA7A 
7CAGGCAA7CCAAGACAAG7CCAAGC7ACGC7GGGAGGAAACC7GAAGGC 
AGCAGCAGG7AGAC7GGC7GAAGACAGACAGGCAGGCAAC77G7CAA7CA 
GA777G7G77777AAGGAC7777AAC7GGGGAGCCC7CCGGGACAGA7CA 
GA7GAGAG7GAAA7G7GC7CCGCC77AGCC 
>Conria37 

GGCCG77CGCAA77C7G7AAAAGGGAGAG7GG7777A777A77777AAAC 
A7AG7CAAGC7GC7AAAG7A7A7GA7A7G7A7AGA7AGAG7A7AA77AAA 
7AC777CAAC7ACAGACAAAA7CAGGAGAA7GGAA77AAAAAACAA777A 
CAAA7GGG7AA7GGCAGCA77GGG77GCGCCCACCCACGAGAAGGCAGAC 
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ACCAAGA77CTAAGA7C^ACGTGGCCAGC^ 

77CG7GA77A7GCA77A77777C7CGGAAAG7777CAC77CAC7A7A7GC 

TACTTGACACTTGCTTTCCTAAGACATCCCTCTATTTTTGAGATGACTAA 

C7CAGCAA77CA777C7C7CACGCA7AAGC7G7CAC7CAACCCAAACCCA 

CCAAGCC7GCA77C7ACCC7CAA7AAGG7C77GG7G7G7AAAC7GACCCA 

CTTCACCTAGTTCC7TAGCCCTCTCTTGACCAGACATGACTCTTTCATAA 

GC7AGACC7A7AAAG7CAGGGC7C77AAG7AGC7GA7C7C7GATAG7GCC 

AX&TGTCCCCCAC7GTTCACATTTTCCAC7CCAGCTTCTAACAGGTGATA 

GAC7GC77777GGGGG7AGGGGCACCAAAACA7A7AGACC7CA7G777GG 

A7G7AGACAC7CCAG777C777AAA77ACAAC7ACA7A7TAA7AA7GAC7 

7CCAAG7G7ACA777CAG7CCAGA7C7C7CCC7GGA7CCCCAAAC777G7 

AAAAC C CACCGCC7AG77GA7A7C7777GA7G7C7GACAGG CA777CAAA 

T77AA7AC7G7CACAAACAAAG77A77GA7777CA7C7C7GCA7C7G77A 

CAAA77777C77AC777GG7AAA7AGCACCCCAGGC7G7G7CAC7GCCAA 

GAAC777CCACAGC7C77GGAA7AAAA77CAAAA7A7777CCAAGGCAGA 

AAGGCACAG7G7AA7C7GGC7CC7GCC7ACC7C7CCAACC7CG7A7CACA 

C7AG7C7CCC7G7CAC7CACCCCC7CCAGGAGC7CAGG7A7CC77AAAG7 

77C7777C77777777777777777777777GAAACAG7777GC7C7G77 

GCCCAGGC7GGAG7GAAG7GGCA7GA7C7CAGG7CAC7GCAACC7CCGCC 

7CC7GGG77CAAG7GA77C77G7GCC7CAGCC7CCCAAG7AGC7GCAA77 

ACAGGCGCG7GCCACCACACGGGGG7AA77777G7A77T77AG7AGAGA7" 

GGGG777CACAA7G77GGC7AAACCGG7C7CAAAC7CC7GACC7CAAG7G 

ATCTGACCACT7CAGCC7CCCAAGG7GC7GGGATTACAGGCGTGAACCAT 

7GTACCC7GCCTCCTTGAAGTT7CTTGATCCAGACTCATTCCTGCCT7AA 

GG7C77GCA7C77CAGTCC7CCCC7CAAATGACACC7CCA7GAAGACGCA 

A77ACC7G7AA77ACCG7G7C77A777AG7CAA7G7G77GG7777C7G7C 

7CC7CCAC7ACAG7G7AAGC7C7A7GAAGGCAGAAACC77GGCAG7CCAG 

T7CCCAGCACAG7GCC7AGCACACA7AGG7A777AATAACACACAG7AAA 

A77CACC7777AG7G7GCAA77C7GAG7777GACAAA7GCA7CAAG7CA7 

77AAG7C7GAC7A77A7CAAGC7A7AAGA7GG77GCAACAC7A7CAC7AA 

77CCC7CA7GC7CC77GG7AG7CAG7C7CACCCC7AACGCCCCCC7CC7G 

GCAA7CAC7GA7CCG777777G7C777A7AG7777GG77777CCAGAA7G 

CCAA7AAC7AAG7777GAA7GAA7GAA7GC7A77AAC7C7CA777C7GAC 

7CCAGAGCAACA7CCA7GCAA7A777A77A777CAGCCCCAAA7AC7GCC 

CCC7CACC77CAC7CCAACCACC7ACTTGA7GA7ACAAGG7GAGACA777 

GGCA7G7GC77CC7CCA7G77CC7AGCA7777CCC7A7C7CC77AGCC77 

CC77C7AA7CA7AAACGAAGAGTGAAC77TCCCTTTCTAAAGGCAAC77A 

C7CC7AGGACC7CGATGCCATAATTTTGTTTCTCTAGTACTTTCTATATA 

7ACACCAAACAA77AGC7CCAGAAAGG7AAAGAC7CAC7G7G7GC7CA7C 

AC7G7G7C7CC7AGCGCC7GGCACAC7GCAGG7GC7GAAGAAACACC7AC 

AGAA7GAG7GAA7GAA7C7C7CCC7C7C7AGAC7CC77C7C7777G7AA7 

CAAACA7G77CAACC7GCAACACAG7C77A7GACCAA7CC7C7G77G7C7 

GACC7AGGC7GAGC7CCAGGGC7GGGACCC7GAC77CC77A77CACCACC 

7CAAGG7C7C7GCAC7CAC77C7C777C7GC7CAGGA77G77777C77C7 

7G7CACCAG7C7777C7CAGAC77AGG7C7CAGC7CAGACA77GC7G77G 

AAAGTACTTCTACTGATCCTTTTATCTAAAG'CAGCCATTCCAGCCCTACT 

C7C77GA7CA7AGCACCC7GAA77AAG77G777ACT7AC7G7C7C77CAG 

GAGGGCAAGGAGC77GG7GG7GG7G77CAGGGC7G7ACCAAGC7G7ACC7 

7GC77CACCC7GC7ACAC77777AGCAACCA7C7AA7777ACA7GC7CCC 

T7CAC7CG7CAGAAA777CC77A7777C7AC77CAAGCAGG7A7ACA7A7 

G7GC77C7CC7GGGAGGC7CACCCAC77CA7GAGAC7ACA777GG7CC7G 

GG7AGAAAG7G7ACAAAA7CCAC7GGC7CAG7777AATCAA7G7A7G77A 

A7A77AACCAACC7GAGA7C77GA777CCACGCC7GGC7AA7777G7A77 

777AG7AAAAACAGGG777C7CCA7G77GG7CAGGC7GG7C7CGAAC7CC 

CGACC7CAGG7GA7CCGC7CACC7CGGCC7CCCAAAG7GC7GGGAC7ACA 

GGCA7GAGCCAGCG7GCCCGGCC7AAGA7C77GA77TC7ACCA7C7GAAC 

7C7G7A777GAAC7GAC7GC7CC7GC77GAGC77AC7GGCCAAAAC77GG 

CCCAC7CAGAC7CACGGAAG777C7GG77C77CCC7GG7AAC7777C7GA 

AC77AACCAC7GG777GC77GACAAGAGA77ACCA7C77C7CAC77CC7A 

3C7A7G7GAAC7CAC77A7C7GC7C7A77GC7G77CAG7C7AGCACGGCA 
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CTTATTGAACGAGTGTCTACATCTGCACCCCCTACTTCTTACTCATCCAT 

7CTG7TTCAATT7CTTAAAAAGAAAAAAAAAAAGCTATTGTAAACATACG 

ATTACAGAAAATGATTTATAACATGTGTATGTACCACCTAGCCCTGTCAA 

G7C77AA7A777G77A7A777GC77CAAA7C777777CAGAC7G7AG77A 

AAAA77AC77AGGAGCCATTA777A7GGCC7A777CC7GACC7AG7C77C 

77GA7GG7CAA777GCC7AA7CA7C77AAG77GCAAAAGC77AGAA77AA 

AGgAAAG7ACC77CGA7CC7C7GC7G77GCC77C77777AA7A777GGG7 

7TG777GGG7CCCATTTACGG7TG7GACATCAGC77GAG77T7GGGAGC7 

G7C77G77CAGAAAA7GG7TC7GGGGAACAGCC77777CAAC77GGAG7C 

CAAAG7C7GTGCTTTTTGCTGAAAGCCATTA7TG77ATGT77ATTACCAC 

7GG77CCA777GG7C77A7GC7AGGGG7GC77GGAA7GGC7GAA77AAA7 

C7GCCAAC7G7CAAA77AGGCC7C7GGC77ACGGC7777GAC7777GCAG 

TACACA7GA7G7C7GAGGTATACAAAC77GGC7GGAC77C7GA7C77GC7 

7GA7G777GGA7G7C7G7TGTTA7A77CACCC7GAAGCAAAC7GGGGTA7 

G77C7GGG777GG7GTGC77CAC7C7C7G77CAG7AACAGGG7A7GACCG 

7A7C77AG777CA777GG7C777CA7A77GAC7CC7A77AACC777A7A7 

CT77GA7G77C77GAC7AC7GGT77C777GATGAC7GAAC7T7AC7AAGG 

GTCCGAA7AAAG7GAGAGGGAACCG7CC77GAGGG7777ACTCC7GGTC7 

TGCAAGA7C7GC7CC7C7AGAGAG77GC7G7GA7777AC7GGGAAAG7CC 

7GC777G7G777C7CCAACAAA77G777A77AACCC7A7C777CAGAACA 

GCAC7A77AAC7GAACT7TTGCCCAAGGC77G777AGGAACTAAACTG77 

C77GG777GA77A7AAGAG7CAG7C777GGC77AC77C7GGTA7A7AA77 

7AGGA7C7GGCTTCCTCTCAGGT7C7G7TAAGA7A7C7AGCAAG77CTC7 

77G777G777C7777AGAAAG77A7CCAAAGA77CG7777CAACA7GGA7 

ATTAT7CA7AAAGTCTATACATTTACCATT7CC77GATCTGTTAACTGC7 

GC777G7AG7777CAATTGCTCTA7A77AAG7GACCCCACAGG7777C77 

GACAG7CC7CC7G7GG7GGAC7A7C7AGC77CACAC7G77GAAAAC7C77 

GC7GAAAAGC77AGAC7ATGGGT7AGAAGAAACACA777TGAAG7CCGCC 

7TT7GCCCAGAAG777TGG7GGC7C7AAC77CAGC77C7GGGACCC7GCA 

GTA77AGG7GG7C7GGGCTGGAG777AA7GC7GA7GGACCTTT7AGGTTT 

GACAGGCAAAACAACA7GG77GG7AACA7CA77777GGGTC7AATAG7C7 

GAAAAAACAAAGAAAA7ACATA77AAAAAA7CC77AACA7ATC77A77G7 

7T77AAAA7AA7AACTG7G7TTAACACA7GC7AAAAAAAAAA7CA77TTT 

AGAA777CA7C7AAGAAAG7TGAATCC7CAGAAAG7AAAGAAAGAC7CAC 

7AA7AGG7AG7777TGTGT7TTT7777777777777777GAGACAGGA7C 

7TGC7C7G7CACCCAGTCTGGTGTGCAGTGATGCAATCTTGGCTCATTGC 

AACC7C7GCC7CC7GGG77GAAGCAA77C7CCCACCCCAACC7CGCAAG7 

GGC7GGAC7ACAGGCGCA7G7CAC7ACACC7GGC7AC777777G7A7777 

7AGTAAAG77GGGG7TTCACCATATTGGCCAGG77GG7CT7GAAATCC7G 

ACC7CCAG7GA7CCACGCACC77GGCC7CCCAAAG7GC7GGGA7AACAGG 

TA7GAG C CAC C AC AC C7G7 CC7AA CAGG7AG77777ACAAC77G AG77C C 

7A7CAGAAG7A7A77AGAA7C7777AGC77GACAGAA77AAGCAGAGA7G 

CAG7GAA7A7ACAAAAC77GC7C777CAAAAA7GAA777GCC7CAAACAG 

7AG77G77GAA7GCC7A77A7A7CC7AAG7GCCC7CCAAAGAACCC7GAA 

AAAA7ACA7ACA7AA7GAAC77A7G77AGGGIACC7CCCAACAAA7C7C7 

CC7AG7AC777G7A7AGCCACAC7A7A7G777777AAACCAC7GCC777G 

7AAACA7CACAG7A7CAC7CAAGAACC7C7G7C7CA7CCC7GGAGA7CAG 

7GACAAGGAGA7AGG7GGCAGA7GA7G7GAGGCC7GAGA7A7GC7GCCAC 

AGC7C7CAA7AAACA7G7AACA7C77AA7AG7CA7A777G7AAAA7CAGC 

CAGGACAGGG7777AAGG77AGAG7C7A7G77AA7AA7AAACAAA7G777 

AG7CA7G7GA777AAG777GGA7AAGAAAGG7AGGAC7CGA77ACAGAGA 

A7777GAAAAC7AGGGAAGGGAG777AGAA77CA7A7GG7AAG7AA77GG 

GCAAGCCAC7A7GAA77CC7GAGCA7C7C7CA7GAAAGCAA77AC7CAGA 

AAGGAGAA777CACAGAGA777A7GGAA7A7G777CCAGGG7AAGA7A7G 

GGAA7GC7AGAG77ACCAC7C7A77777GA777GACAAA7A77G7GAAGA 

A7CAC7ACA7AAAC77GGCGAG7A7G7AAAGGA777C7AACCAGAACCA7 

77GGCA77GAGGGCAAAGAAA7G7C7AC7C7GGA7GA7AGCGG7G7G7G7 

GG7G77AC7AGGAG7GAAACAGCGGAG77GGGAG7GGGAGGCAGAGAGA7 

GGA7GG7A7ACCCACAA7GGC7A7A7C7GGA77AA7C777GAGCACCAAC 

A777A7A7ACACC7CGGA7C7C7CCA7CA77GC77AC7GAAGAGG7GGAG 



FIG. 3 (35 of 52) 



WO 99/06426 



PCT/US98/16102 



H— jr" - ** "TP" 

GGACGTTGGCATGAAAGLiTCCAAATGTGTTTTTTTAGTTGCTTTCTTAr- 

A7A77AAAAACGAA77GA7A7AA7CCACAAACCA7AAAA77CACCA7777 

AGTAAGTGCACACTTCTGTGGATTTTAGTATAGCCACAC7ATTATACAGC 

AATCACCACTGTCTAATTCCAGAACATATTCATCACCCGTAGAAAGAGAC 

TTGGGTTTACTTGTTGGCAGTCCCTCCCCA 

>Contig38 

GGTCTACATGTGCTCGCAAGATTGGA7ATTGAAATATCAGCAAGAAATTA 

AA7GACA7AG7AG7CA77A7GCC7AAA77A77G77A777777GA77GAAA 

AAAGTTGAATATTTCAAATATCAAGGTAGTAGTGAGATATAATAAAGAGA 

GAGTCAGTTCTAAGTATAGAATTGCTGATTCAGTTAAGCTCTGTTCTCCA 

ACA777GGGCCACA77GAAGAGACCA7G7AGC7GC777CAGCC7CGG777 

CC7CC777GCAAAA7GGGGA77ACAC7ACC7GCC7CACAGAGA7G7AAAC 

TTATGACATGTTATCATGATTGCCAGGGCCCACCTGTTTTCTTTTAAACA 

TTGAAATCACTGTGCCTGAAACAGGGATTTCCCTGCCCTTTGTGCAAGCT 

CCAGAAACAGGAGTCAGCCTGAGTCCCGCAGCTAAGAACGTGGATTCTGG 

TCATTTTCTCATAGCGAACACACTTCACAGGTCCTTCAAGGGAGTACATT 

7TCCTATAACTCACCTTAATCTCAGTTGAAGCCTCGTTTCTTATTTTGCA 

CTGTGGCCAAAAACTAAATCTCATTTCTTTCACG7AAACTTCAGCAATTC 

AATAATAGTACAGTCATTTTATGTTTCAACTGAACCAAGTCAGGGTTCCA 

CTCCTGCCTCCCCTTTCTGCTCTGAGGACATCCATGAAGTGGAGGGGGTC 

TATGTAGCCTGGAGCTATTGGTGA^GGCGATGGGTCCGTGGTGGTCTTG 

GGGAACTGCGGGGCTGTGTCTGGCTGGTCTGGTGTC7GG7GATTGGCCTT 

G77CCACGCGG77CACGC7GCAGGACAG77CG7G7CC77C77G7CC7AA7 

GA7CAGC7777AGGC7CACGGGCC7G7C7C7GC7GAGA7A7GGAA7AGGA 

CAGCC7C7GGA7C77C777AAAC7C7CC7GGGGCCACAGGGGAC7C7G77 

7G7G7C7G7GCCCACA7AGGA7GAT7C7GCCCAGACC777GC7GCCA7T7 

CTTGC7GTTCTGCTGTTTTTAGTCTCTGGAGGGCT7GCAGTTTCCTTGGG 

G7CCC7G7GGAAGCAAAGCAAAG7CC7C7CCACGC7CAGA7G7C7AAACG 

7A7C7GGG7777A7CG7CCACCCA7CCCAGAGC7CAG7C7AGAGGAGGGG 

GCAGCC77CGGG77C7C7CC77CC7CCCAGAGCC7C77CC777GCACCAG 

GGCAGCCTC77CC7A7C7G77GGAAAGGGC7G7C7GG77C77GAA7A7AG 

AG77GCAGG777GAGGGG7G7AGGC7GAGG7AAGGCAAAC7A7CACA7GG 

AA7AAAAA77ACCC7G7G7CAAGGAACAACCAGAGC7GGACAG77777AA 

A7G7GAAAACCAA7777A77CAGGAC7A7GGCGAGAGG7GAAG7AAGACC 

7CAG7A7AGAAC7GGGC7CAA77CCGAA7GCAGCA7GGGCAAA7GGGAA7 

G7A7AGCC7AGGAGCAGGG7GGGAACC7G7GGA7GAAGAA77AC7AAAAG 

GGCA7A7CAGGGG7GAGGGGGCG7CC7GGC7ACACCCAC7AAC7AC7G77 

GC7GAAGAAAGGCC7GG7GACA7CAC7GGGGAA7GG7GGGGGA7GAAGAA 

TCCAA7CAGA7GGA7A77GAGGA7AAGGGGA7C77GA7AAAC7GGC77AG 

3AGGG77T77GC7AAAAC7GG7777CA7AGG7AAG7CCACAGACAGG7C7 

7GGAGAAAG77CAGGGACC7ACGG777G77CGGGCAGA7GC777G7CA7C 

7G7CACAC7GGCAC7G7CACC7GGC777CC777AG7CCC7CCCCCCC777 

7777777C7GGAG7AG7777GGGAGACCAGAGGAGCAGGGAG77AGGGAG 

AG7AG7CAGAAAAGGCCAGAGAAAA7AAGGAGG7G7C7G7AGGGAAAA7C 

C77AAA7CC7C7AA77AAAT7AA7TTAA777A777A7C7GGGACAAGG7C 

7CAC7C7G77GCCCAGGC7GAAG7GCAG7GG7G7GA7C7CGGC7CAC7GC 

AGCC7CGACC7CAGGGC7CAAGCAG7777GCCACC7CAGCC7CC7GAG7A 

GC7GGGGC7CACAGG7G7GCAC7ACCA7GCCCGGG7AA77777GGG7777 

7777777777777777777777777777G7AGAGA7GAGG777CGCCA7G 

77GCCCAGGC77GG7C7CGAAC7CC7AAG7GA7CCA7CCACG7CGACC7C 

CCAAAG7GC7GAGA77ACAGGCA7GAGCCAC7G7GCCCGGCC7AAA77C7 

CCAA77T77AAA7GC77CCC7G77CCC7G77CCAGA777GGGA7A77GAC 

7GC7G77AAA7CAGCGA777C7CCC7G7GGAGAGG7AGCCAA7AGGAAGC 

AACAAGAG7GAGGAG7CC77A7A7CGAAA7AGAGGG7AAGAGAAGAGACA 

GA7G77ATC77GGCAG7GA777AAGAACAGCGAG7C7G7AAGCAAAGCAA 

AGCAAGGC7CCCAGG7GC7GAGAAACAA7GGC777C7GGGGAAGCG7C7G 

7G77CAGAACC77AAG77GGAAACA7C7C7GAAGA7G777GCCA7GAAGG 

7777C77C7GAAG77GAG7C777CA7CAC7AGG7AGGCG7G7777GGAG7 

C7C7A7CAAACAGA7CC7G7G777A77AGGAAGC7G7GG77CA7AAAGCC 

2CA7GC7AA7777GCAGG7AGCAGGG7GGCCC7GGCC7GACCCGGGGACA 
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GAG7GGC7G7CC7CCC1 ^CAGGCAGGAAACTCTCTCCTGCCAGCTAGTw j • 

CTGCATACCCACATTTCAAGGGAGCTTCTGGGTGGTGAGTTTACCAGACT 

ATGG T C T 3 AGGT AG AGTT AAG CAAAA C AAAAC TAAACTG CAT AAAG AAAC 

AGAAAGAAAATCAGG7GTTATAAAAACAATTTGGCATTTGT7TG7GTTTC 

AGC7CCG7G7CGA777AT7GC77CCACAAA7AG7GCCGA7A7GCACCAGG 

CAC7G77G7AAAAC7GAAAA7A7G77T77GGA7G7GCCCAG7C7G7GAG7 

A77AAACGA7GG77GA777GAAA77TGC7A7GA77CA7A777C7GGGGG7 

AAGA7GCAGGA777C777GGGGGGCC7ACGA7G7GGCA77C7AGAA77C7 

CAAAGAATCAACCC7GG7GGGACCAGGAAGAGC7GAGC7GAGGCC7C7C7 

GC7CATG7G7AC77AC7GGAGA7CA7GGAGACAGG7GAGCC7GAG7GCAC 

GTC7CACCAAAGCCACAGCAGAGGGGGAGGAGGCGGAAAGAGAGC7C7C7 

CCA777CTGAGAAG77AA7GG7AACAATGGCATACA7ACC7AC777ACAG 

T7GAAAT7GGAAACCACAGCA77AAG7G777CCAA7GAAA777GGCAA77 

TGGGAG7777C7GAGC7GCA77GGA7G7GG7777GCA7GC7G77AGGA7G 

AGCAAGAGA7GA7GGAGAACA7C77CC7777GAGC77CC7C77GGACG7G 

GG7CAC7CCCAC7CA7GGAA77AGAAAGC77AGACCTAGAC77GAA7C7C 

ACC7TCTCAAGG7GC7CCCGGGCAAATCAC7TAAGA7CCA7C77C77C7C 




AAA7GACACGG7AC7GG7AGAAGAAAAGG7CCAAG7C7GC7777ACAGC7 
CCCC7CATCCCCAAA7G7AC7CCGACCCCAAGA7GACCA7G77A7CA777 
GA77GACA7CC77C7AG777CAAC7CA777C777GCA7G7A7A7GCACG7 
ACA7A7ACAC7A77T7A7777GCCAGGGG7CACCG777AGC7GCA77AA7 
T7C77A7AAAA7AA7C7A7A777AC77A7GG777ACG7AAAACAACA7AC 
ACA7GTAAG7GTATAGC77GA7AAG7C77CAC7G7AAACCAAAAA7AAAA 
77CGAAGCCCCCCCAACCG7C7GAA7GGACCCC7C77C77GGCCAAGAGC 
A77CCAAAG77AACC7GAAAAAAC7AG77CAGG7CA7GA7GGAAGGGAAG 
G77GGACATGCCCCAG7A7ACCC77C7CCC7777GGAA77CAGGAAAAGC 
7GACCAGCA77AACA7CAACACAGACC77A7G7C7GA7AGGAAAC777GA 
CAA7C7A77CCC7C7GAAGC77GC7ACCCGGAGGC77CA7C7ACAAGA7A 
AAACC77GG7C7CCACAACCGC77A7CA7AACCCAGACA77CC777C7G7 
7GAGAA7AA777ACC77G7AACC7GGAAGC7CCC7GCT7CAAG77CCC7C 
ACC777CCAGA77GAACCAA7G7AAACC77ACA7GCA77GA77GA7G7A7 
7A7G7C7CCC7AAGA7GAA7AAAAGCAAGC7G7A7G77GAC7GCC77CAG 
CACAGG77G7CAGGACC7CC7GAGGCTGGG7CACGGA7GCA7CC77AACC 
T7GGCAAAA7AAAC7G7C7AGA77GAC7GAGACC7A7C7CAGA7AC7G77 
GGG77CAAA7ATA7AAC77A7GAAAC7AA7ACACAAA7CAAG7CA7AGAA 
7A777CCA7CAC7CC7CA7C7ACCCCCAAA777CC77A7GCG7C777GCA 
G7CAACC7CCCACCCCA7CCCCAGGCAAC7GCAGA7C7AC77777G7C7C 
TGCACC77CAAC7GACCC777C7G7GA777CA7A7GAA7GGAA7CA7GCG 
C7GAGCAG7C7777G7G7C7GGC77C7777GC7CAGCA7AA7G77777GA 
GG777G7CCA7G77T77G7G777G7CAA7GG77AA777C7C7CCA77GCA 
GAG7AG7777C7A77G7ACA7G7G7ACCACAA777G7A7A7CCA77CCA7 
7GC7GA7GGACA777GA777G777CCAGA7777GGCAA77A7GAA7AGAG 
C7ACCA7GAACACCCAGG7ACAAG7C777G7G7GGAC77A7G7777CA77 
7C7C77GGAA7GGAAC7G7CA7A7CAA7AAG7A7A7G777AAC777G7AA 
GAAAC7GACAACAAA77A7C7GCGA7GG77A7GCCA7777G77777C7AC 
CAGCAA7ACACGAGCA777CAG77GC7CCACAAC777GCCAAAAC77G77 
77C777AA777GGACA777AAG7GG7G7ACAGAGGCA7C7CA77GTGG77 
C7AG7777C777GCCC7GA7GACCAA7GG7G77GAACA7C777TCA7G7G 
C77777GACCA777ACA7A7CC7C7777G7GAAG7G7C7G77CAAA7A77 
77TGCCCA777AAAACA777GGGGG777G7C77A77A77G7G77GGGAGA 
G77CCA7A777A777A777A77GAGA7GGAG7C7CAC7C7G77GCCCAGG 
C7AGAG7GCAG7GGCG7GA7C77GGC7CAC7GCAACC7CCAC77CC7GGG 
77CAAGCAA77C7CC7GCC77AGCC7CC7GAG7AGC7GGGA77ACAGGCA 
7G7GCCACCACAC7GGC7AAG77777G7A77777AG7AGAGA7GGGG777 
CA7CA7G77GGCCAGAC7GG7CGCAAA77CC7GACC7CAAGCAA7CCACC 
7GCC7CGGCCC7ACAAAG7GC7GGGA77ACAAGCA7GAGCCAC7G7GCC7 
GGCCCA7A777A77T777A77C777A7777G7A7ACAAG77C77GG7CAG 
A7ACAA7AA7ACC7GG7CAGA7GAGA7AA7GAG77GGAAAA7GC777GCA 
AA7GGGGGAGAA7AA777AAA7G77A777A777A77AAGAGCAGAGGCCC 
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TTCCrrGTTGCGGTCAC-jAAGCCGTTTGCT 

AGCAGAG7CGAGCTACACAGGC7G7C7G7GT7GGC7GC7AT7AG77AA7C 

AGAGAG7777777777C77GCC77G7CA77C7AA777G7GACACA7AA77 

AGCCACAATATGTGTTTTCAGTTGTGACACTGGCC7GGGAAACCAAGGGA 

7G777AGAG7GGA777CC77GA7777GCAA7AA77G7G7G77777C7GCA 

TC77 CTG77AAACACAAA77CA7GGAAGCAAAAC A7GGAAGCAAAG7ACC 

Z7GGACA7CCCCCC77C777A7GAAA77GA777C7C77AAA7G7AA7G77 

7GC77G77CCC77AC777AAAAGCAA777AAGAG777A77GAGAAAG7GA 

GCCC7GGAAACA7AGA7GCA7AGAGAGAAAA77C7ACCACCC7CAGG7CC 

C7A77G7C77C7C7CA7AAAG7G7AG777CAGGGCC7777AGAAG777C7 

777C7GC7C7GA777GCA7G777G7GAG7G77GC7AT777AAG7A7T7GG 

A777GG7C7G CAAA7CC7A7GAGAGA7GGCAACAGAG7AGGGA7C7CAAA 

GCC7GCAGG77G7A77AAG7CCAGCAGGGCC77G7A777ACAACAGAGGG 

TCC77GAAGACA77CCA7A7A77A7GC7AGGGGAG7GGCCAAGCAAAC77 

7AA7G7G7CCC7A7GG7GGGA7A777GGGG77AA7ACC7GCCC77C7C77 

^j^j»rr»rr» ^ *^"3! ,, X I X7 C7777 C7777777 C777 7 7 C"*77 C77777777777GAAA 

7G7AG7C77GC777G7CACCCANGC7GGA77GGAG7GCAG7G<37A7GA7C 

7CAGC7CAC7GCAACC7CCACC7CC7GGG77CAAGCAA77C7CC7GCC7C 

AGCC7CCCAAG7AGC7GGGAC7A7AGGCACACACCACCA7GCC7GGC7AG 

77777777777777777GAAACNGAA7C7CGC7C7G7CGCCCAGGCGGGA 

C7GCGGACTGCASXQGCGCAA7C7CGG - — 

>Concig3 9 

GGC7CGCA7CCC7CA7A7CCA7GAG7G77C7G7GGGCCC7GCC7C7GAAA 

7AAA7C77GCC777G7C7CCCAG77CAC7CCAGCCACCCA7CC7GGGGC7 

GCACCC7CC7CC77CCAAGCCC7C7GCC777CC77CC7GG7GC7GCC7G7 

CATG7CAAGCATATGCATCAGTGCGACCAGGACATTTGAAATGCAACCAG 

7ACAA77GGGCGCGG77A7GCC7ACCAG77777C77CC77AAACA7777A 

7A777A7G777GAAAGCA7GCCACC777C77CAC77GCCAAC77GACAGA 

777A77AG77GACAACA7CCGC7GA7AGCA7CAG7AA7AAG77AA77G77 

777GCACA7G7AGC777AA77A77C7CA77A7CA777A7AGGAG77A77C 

777G7AAAGGG7AAC7GAG7777CCAAAACAAACAGAAA777GGGG7GGG 

CCCA7GGAGCG7GAC7CA7GAAA7CAGA77C77AGAAGGACC7CGGCAAG 

7C7C7GGG77GC7G77AA7GAGCC7GGC7GGC7GCCAGGGG7G7G7C7GC 

CC777A7GAGGCCACCAC7G77CAAA7GC77GCC7GCAGCA77AC77GCC 

7AGG7AG7GC77G777C7AC7GAAC7G7CAGGGA7CCAA77C777G7GG7 

C7AAG7AACAA7AC7CAGA77CACAAGGAA77GA77AA7AAGCCAGAA7G 

CCAA7G7A77ACA77777GA7GAAGACCA7A777ACAG7GA77G7A7C7G 

C7CAAGC7CAAA77AGGAT7AGAG77C7GACAAA7ACA7A7G7GAGAAG7 

A7GAGG77AAA7AC77GAAA777GGAC7777C7AGAAAA7C7GAA7G7GA 

77GCCA77CACA7ACC777C7GGGGA7GA7GA77C77G7AC7777A7777 

AAAAGACA7AGAAAAC7AAC77AAGAA7CAGA77GC77GGC7GGGCACAG 

7GGC7CA7GCC7G7AA7GCCAGCAC777GGGAGGCCAAGG7GAG7GGA77 

GC77GAGC7CAGGAG777GAGA7CAGCC7GGGCAACA7GG7GAAA7CCCA 

TC7C7AC CAAAAA7ACAAAAAAAAAAAAAAAAACAACCAAAAAGAA7AAA 

77AGC7AGG7G7GA7GG7GCG7GC77G7AG77CCAGC7AC77GGGAGGA7 

GAGG7GGAAGAA77GC77GAGCCCAGGAGGTGGAGG777CAG7GAGC7GG 

GG77GCAACAG7G7AC7CCAGCC7GGGCGA7AGAG7GAGAC7CCG7C7CA 

AAAAAAAAAAAA7CAGA77GC777A77GC7GG7777C777C7AAAAC7GA 

GA77GGG7CCCA7CA7CCCC7GGCCCCCA77GG77AA7GG77CC7CC777 

G7C7A77GAA7AAAA7ACAGA7G7C7GC7777GGCAACA7GG77GAA7G7 

AGACACTGCAGGG7C77CC7GAC7CAAAA7GAG7AAGGC77AGA7AAAAC 

ACA7777GAAA7GCA777C7GGA7GAACAGCAAGGAAAGGAGA7C7C77A 

AAA7CC7C777C7G77CCCC7C7CCC7ACCCCC7CCAAG7GGGC77AAG7 

AGGAAGGG7GG7GAGCGGCAGG7AAACACACG7CAAAGGCAG7C77CC7C 

TC7GAGGGAAAACAC77G7A7AAGCA77GCAA7CAA7GGGCC7C777AA7 

7A7G7GCCAG7GGCAAGAGCGGG7GC7GAACCCAGGGGCC7GCC7CAA7C 

CGGGGCC777GAGGCAGAA7AAAG7GG7C7CAGG77G77GGCA777CC77 

GCCG77CCACCCGAAGCAGACACAAA7CC7C7C7GGAGGCAAG77CCCCA 

AT7CAGCCAG7ACAACTCCCACAGAC7AAGATCAATCATGTACAAGCTCA 

CAGACAAAGG7CACCAAACACACAGAGCAA7AAACAAAT7CA7GAG7GAC 
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37GAA7GAGAA7AAACAv_ ^AACAATAACCACCAGCTGGGATGCTCT.AAG • 
CTTCAGCTGTTAGAATTCCTGAATATAGAATAAAACTGCCACAATGGCAA 
ACATGCATCTAGTACTTACTGTGTGCTGGGTTCrrAAGAATTTTGCACATT 
GTGCCAGATACCGACTCAGC7TCACAC7CACCCTCCTACTGTGCCCTCTT 
AATTTGCACTAGATTAAAAGGTAGAAAGGAAGAGGCAGCTATTCTG7TCT 
TGGCTG7GCCTCTGGCAGCACATGCAAAATGGGCAG7AACAGTGGCAGTC 
ACAGG7AAG7AGCC77C7CACAG7G7GGAG77AAAGGCA7GGGAC7GAGA 
CGAGCAAGG77CC7AAAGGGACAG7GGCCAG7AGA7GACCAGGGGC7AC7 
GGAG7GGC7GCA7GGC7C7G7GGAAGC7CAGAGGAGCC77GGG7CC7GCA 
GG7GCAG7AGCAGCT77C7G7AG77CC7GA7C7C7GGG7CCCACAA7C77 
CCCCG77777GC7CC7CCAC77C7AA7777G7AAC7GAC77CCC7G7G7G 
7AC77C7C7C7C7GA77GAAA7AGCCAGAC7GG777C7G777CC7GA7AA 
GACA77G7C7GG7ACGAACACAG7AAC7CA777AA7CCGA7A7C7C7ATG 
AAGGAGG7ACAA7AA77A77CC7A7777ACAGA7GAGGAAACACAGCAGA 
GAAA7AAAG7CAA77G7C7AAGG77GCACA777AG7CAAGGGAAGGG77G 
A7A7AACA7A7AA77A777AGAAAACA7C7AAGGAAA7AAAAGGCA7AA7 
77AAAAA7AAAAC7AGGCAGG777AAAAAAA7GAAG7AA7C7ATAAG7AA 
AAAAG7A7AA77G77GAAA7ACA7A7C77AG7GGA7GGG7TAAA7AGC7G 
AAGAAA7GA77AA7GAAC7GGAAGG7AG77C7GAGGAAA7CAGAA77CAG 
CA7AGA7AGAAAAAA7GGGAA777ACAAAAG7ACACAGGAA77A7AAAAG 

^AGG77AAA77A7AGGGAGGG7AGAA7GAGAAT7AACA77GGTCTAACTGG 

AATTTTGGAAGAAGAGAATAGAGAGAATGAACAAGGCAATATTTAAAGAG 
G7GGC7GAGAA77TTTCAGAACCAACACAAAC7A7GAC7TTACCAG7AGA 
GAAAACAA7G7ACAC7GAGGAGGA7AAA7AAA7A7AC7A7GAACAAA77G 
TAA7AAT AA7AC 7 C AACAAAG ACAAAG AG AAGA7GTTAAAA7 CAGCAAAA 
AAAGAAAG7CAGACT7AGAAAGAAA7GACAA7GGCAGAC7AC7CAACAAC 
AACAA7GGAATCCAAATTCGGTCAAACAGTATTTTCTTCATGCTAGCATA 
7AGC 

>Contig40 

GGGAG7CCGC7A7GC7CC7AAAGA777GCACC7C7GATC7GG777G7AG7 
7AGTC7C7777A77GC777ATCC7AC7CAAC7AA7777777AGTGCC7G7 
7777777777777AA7G7G7G77GA7GAC7ACAA77C7AAAC7CA77C7A 
C7GA77CA7GGG7GC777AAAA7C7GAGCAG7C777CGCA777AC7GCC7 
G7GA7GGCCCA7CCCACCAGC7AAAG7G7G7GGCCAC7GC77ACAGCACC 
A7G7GA7AACGAG7AAGGGAGAGA7GCCGCCCAGAC7C77C7AGGAGCAG 
CCAG7AGGACC77CCAGGGGT7GCAAGCAAACCACAGCAA7A7G7GGAG7 
G7GGCAGAGGA7GGCCCCAAGAGGA7G7GGCAGCGGC7AG7GCAGC7CAG 
C77AG7C7SAGAGGAAA7GC7GGAGAGGAGAGCCCAG7C7G7ACAGGCA7 
GACAGCCACAAGGACTTCAACAGCTAACA7GGCTGAG7GGAC7T7ATG7G 
C7A7C7CA77CAGAAAACAGGAGCAA7CAGAAAGGAG7CACC7CC7A777 
G7ACCCCAGGAA77GC7AACC7AC77GCA7C7GAA7GA7G7CCA7CAC77 
CCC77CA7CACC7CC7C7GGGGGC7C7GCAAGGA777GAC7CC7GCA7TA 
GTGATCTGTCTCACCTACGTTGTGATTCACATGAACTTACTAATGTGCTA 
7G7GACAAC7ACCA7C77AAACACAAAAACCC7C7777GA77C7G7GGC7 
CCC7CCAGC7ACCCC7GCA7T7CTC7G7CCCCC7GCCCCG7C7C7GCAC7 
CACT777A7777ACAGCAAAAC7AC7CAAGGGAG7C7CAGTGC7CC77GG 
CTCCATGTC7CCACCTTTCATTCTCTCCTCAGTTCACTCCTGTCAGGCTT 
CCG7CC7CAAGC7C77C77CAC7777G77C7AGGGCCGC7GACA7CC7C7 
77C77GCCAAA77CAG7GGCCAGG7CC7CAC77AC7CAAC7GC7CAGCA7 
7G77GGGCC7GG7GGACCACA77C7CC77CACCCACC7777GC7GC7C7C 
7C77C7C7CCAGA7G777C7C7C77C7CAC7GGC7AC7CC7C7777G7C7 
CC777G77AGC7CCA777C77CC77CCAACC7CAC7G7GC7GG7G7GCCC 
AG7GC7CAG77777AGC7A77C7C7C7777CCAG7GGCA77CA77AGA7G 
G7A7CA7G7GACCCA7GGCA77A7A7GCC77C7ACA7GACAG77AC7CC7 
GAA7A7GAA7C7CAGGAAAGA777GGA777A77777AA77AA7777777A 
AA7777A7777AA7AAA7GAGG7C7C7C7C7G7CA7CCAGGC7GGAG7G7 
AGTA77GAG7GA7G7GA77A7AGC7CAC7GCAGCC77GAACCA7GGGC7C 
AAG7GA7CC7CC7GCC7CAGC77CC7GAG7AGC7GGGAC7ACAGGCA7G7 
GCCACCA7GCCTGGATGAC7T7TTG7G7G7GTG7G7G7GTGTGGAGACAG 
. GG7CT73C7C7ATTGCCCAGGCTGATCACAAAC7CC7GGCCTCAAGTGA7 
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CC7C7CACC7CAGCC7L .CAAAGTGCTGGGA7TACAGGTGTGAGAC£A. J ; 5 

C7GGGC7AAGA77CAGA7777G7A77CAA77GAC7G777GACA7C77CAC 

77GGACACC7AAGAGG7A7C7CAAA7A77AA77AAC77GGCCAAAA7ACA 

GAACTTTTGACCCCZTGCCCCCACAATACTTGCCCCTTCCCCAGACTTCTC 

CATTTCTG77AAATATCCCCAGTTACTCAACCCTCAAACCTATGAATGCC 

C777GA777C777CT77CCC7CA7C7CC7ACG77GACGCCA7CAGC7AG7 

777G77GCC777ATGCCCAGAA7A7AA7CC7CACCACC77C7C7CC7A77 

GCCCGAG7A7AAGA7G7CAG77777CC7GCACAG7CCA77GCCC7GACC7 

CC7GAG7GG777GC77CCAC7777GACA777G7A77CC7C777CCCCCAG 

GG7CAA77777CACAGCAAGAG7GGCA7777777777777777777777G 

AGACGGAG7C7CGC7C7G7CGCCCAGGCCGGAC7GCGGAC7GCAG7GGCG 

CAA7C7CGGC7CAC7GCAAGC7CCGCC7CCCGGG77CACGCCA77C7CC7 

GCC7CAGCC7CCCGAG7AGC7GGGAA7ACAGGCGCCCGCCACCGCGCCCG 

GC7AA777777G7A77777AG7AGAGACGGGG777CACC77G77AGCCAG 

GA7GG7C7CGA7C7CC7GACC7CA7GA7CCACCCGCC7CGGCC7CCCAAA 

G7GC7GGGA77ACAGGCG7GAGCCACCGCGCCCGGCCAAGAG7GGCA777 

77AAAACCA7A7AT7AGA7CA77GC7777G7G777GGGAACC7CCAAGGG 

C777GCA7CA7A7A7CAAG77GACACC7C7CC7ACCCAAGCC7GGC7C77 

7CC7GC7CC7C7GTCC7C7CAGCCCC7CCACCCA77G77CA7GC7GC77C 

AGCCACAC7GGCCTTC77GCCA7GCCACA777G7GC7AAGCCCACA7CCA 

A7C7CGGGGCC77TGCAC7CGCA777CC7C7GC77GGCA7GC7G7ACCCC 

AGA7C777CA7GA77GGCAGC77C7G7ACA77CAGCCACC7GC7CAAGCC 

ACCC777CAGAGGGCC77CCC7GGCCACC7CACC7GAAA7AGCACC7CCG 

A77GCACCCA7CCGG77A77C7CCA7CC7G77C7C77GC77GG7GA7777 

CCA7CAC7GA7GAGGAAA7GAACCA7GGAA7GC7AGGGC7GA7GACCAGA 

AC777CCCCCACCCCCACA77A77ACAGAGGAGGAAA7GAGG7CGGAGG7 

AAGA7GGGCCCAGGA777C7AC7CCCGCC7GGAC7GCAGGCACAGCAC7G 

ACC7CAGC7G7GC7CAC7C77GGCA77CACCCAACCC77C7A7C7CCAAC 

7GCCCCA777ACCAGAAAG7GAAA7G77C7CAGAGACGG7GAGCCACC7G 

AC77GGACAGCAGCCCAGGGCCCC7GGCACCC7GC777C77CC7CCC7GC 

CA7CC777CC7C7CCAAGACC7ACC777CCC7G7GA77C77GCCCACA7G 

C7GCA777CA7GG7T77A7GACC7GA777C7GAGAGGGA777GAA7777C 

A7GA77A777A7GTAAGCAAA7CA77A7GC77A7ACAAA7GAGAAAAGGA 

G7GC77C7GGAC77CCCAGGGACAAAA7C77G7CAC77GGC77GC777CA 

7A77GC7AA77AAGGACCCAGGA7G7GGG7GAGA7G7GC7AAAAGC7GAG 

AGGAGGC7C7GGAC7C7GAC7A7GGGCCCACACCCC7GGGCAGGCA7CAC 

AC7AG7CC777AGG7CA7CC7CAACCCAGC77CCAG77GAA7CAGA7G77 

7G7GAA7AAC7CAGCAAGGC7G7A7GGGAAA7GAAGAA7GAGG7GGGGAA 

GAGGCC7G7GCAGAAGACACAC7GAC77ACCCC7C7ACC7C7AAC7AGGG 

7G77G7AGCAGCCACCCACCCACCAAG7C7G7C77CCAGACCACG7A7GC 

777CC7CCACC77TGCA7C7777A7C77C7GCCAGCCCAGA7GC77GC7G 

AC7CCAGCCCAAGCC7A7AGGA7AAGC7ACAGCC7G7CCC7ACAGAC7AC 

GCA77GCAGAA7C7AAGACA7CAAG7CAAG77CGGAAGCAC77GCC77C7 

CC7C7CCAGG7ACACAGGC7C7CC7GGAAAGC7GG7AGCAGC7G7GGAGG 

7G7GG7G7G77ACCTGC7GCAGG7GCAGAGAAG77GAC77CACAGCCC77 

CAGAAAGAC7GCCT7C77CCAG77G7A777G7G7AC77GC77GGG7G7GG 

GGAGGA77C7CAGCT77C7CCAC7CAAA77A7CAGACCC777CCA777AG 

7GG7AGACCA777CCC7CG7CCAGGCCAAGGGCACA7AG7ACAGAGAAA7 

AGGGAG77G77ACCCAGGGAGAGAAC77GGC7C7AAACC7G7AA7AGAAA 

GG7CAG77C7GG7CTGGAGGG7CAA7777GA7C777GGC7CAGA7CCAGG 

AA77GGAAC CAAGGC7777GAACA7777AA7GCAGGGGA77AAAAAAA7G 

A7ACGAG7CA77CACGAA7A7A777GC77AACA7C7AAAGAGA7CCC7CA 

AAACAC7AGAAAAAA7AAGAACAAAAA7C7AA7AAAACAAAA777G77AA 

ACACA777ACCAAA777777T7777GG7AAAAA77CAAA7G7CA7AAA7A 

AAGC7AAAG77CC7C77GA7GAC7CGC7CC7C7GCCC7A77CCAC7CCAA 

G7AACCAC7A77A7CAG7C77GCCAA7ACCC77CCAGACC7C7C7ACC7C 

7A7A7ACCA77AGAAGCACA7GG7777GCA77GAGGA7G7GCAG7G7777 

G7777ACG7AAA7GT7A7CAC7C7G77C77G77CCA7AA777GCC77777 

C7C7CAA7GA777GC77GGC7A7C777C7A777CAG7AGCA7C7CC777C 

77777AAC77ACCAT7G777A777AACC77GCC7C7A7CAACAGA7A7G7 
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AGGTTGi - -CTAG7TGA*TTCAT7^G7ATTTATAAACAACGibATCAGli« s 
GA7G7CCA7AAA77TC777ACGGAAGA7GGCAAGTAG7GGAAT7GC7GAG 
CCAAAGAACATGTTTAAAAAACCCAAAAAAACTAGACGCTACCAATT7TC 
TCTCCAAAATGGCCA7ACCCACTTACCCATACAGAGATGATTTGGAATCT 
GGC7TCC7CACAAGG7GAGA7GCCTTCACAG777CA77C77CC7GGCA7G 
TC77CCZ7777G7A7C7GAGAGAGC7GGCAGAA77G7G7CAC7AAA7CAA 
G9A7AGAGGG7CAAA7GACAGC7CAAGC7CACAGGCACC7C7GC7T7C7 W 
CCCAGACCACC7GC777CC7GCCACCAGC7C7G77CCA7CT7A7AGAA7G 
G77GCCAC77GGG7G7C7GC7CCGACAGCCA7G7CA7CC777GCAC7GCA 
G77ATGAAGCAGACAGAGC7AGGAGAGGGGC777GCCAGCC7C7GCCC7A 
GC7TGGAGAA77TCAAAGAAGGAGGG7A77GAGAG7GAGC7GCCGAAGAC 
TGGCAGC7CCC7CAAC7CAACAG77G7CC77CCACAAGAAG7CAGA7ACA 
T777TTTGGGA7AAAA7A777AAAAA77A77A7777A777C7GAA7AA7A 
7A77TACA7GA77CAAAAA7CAAAC7G7AGGCCAGGCA7GGC7GC77A7G 
CC7G7AA7CC7AGCAA777AGGAGGCCGAGGCGGGAGGA7CAC77CAGCC 
CAGGAG7TCAAGACCAGCC7GGG7AACA7AG7GAGACCC7G7A7C7ACAA 
AAA77TAAAAACAAAAA77AG77GGGCA7GG7GGC7GA7A7GG777GGC7 
C7G7GACCCAAC7CAAACC7CA7G77GAA7777AA7CC7CAA7G77GAGG 
GAGGGTCC7GG7GGGAGG7GA77GGA7CA7GGGGG7GGG77C7CCC7TGC 
TG77CTCA7GA7AG7GAG7GAG77C7CACAAGACC7GG77A777QAAAG7 
G7G7AGCACC7CCCCCT7^C7CTC7CAC7C7CC7GC7CCGCCATAG7AA 
GA7G7G7G7G777CCCC777GCC77CCGCCA7GA77G7AAG777CC7GAA 
3CCTCCCAGC7A7GC77CC7G7ACAGCC7G7AGAAC7G7GAA7CAG77AG 
ACCTCTT7TCT7CA7AAA77ACCCAG7C7CAGG7CA77C777A7AGCAG7 
G7GAGAG7GGATGAA7A7AG7GCCA7A7G777G7A77CCCAGC7ACCCAG 
GAGGC7GAGG7AAGAGGA77GC77GAGCC7GGGAG777AAGGC7GCAG7G 
AGCCA7GAC7G7ACCAC7GC7C7CCAGCC7GGG7GACAGCGAGACC7TG7 
C7CCAAAAAAAAAAAACCCAAAC7G7G7AAAA7G7G77CA7AAAAG7G7C 
77GC7CCCACACC7G7CCC7A7A7A7C77A77CC7CAGCC7CCGACAAC7 
AC777A77CA777C77A7G7A7C77CCAGAA7C^AAAAAAAAAA7CAAA 
7ACAAGCACAG7GGAA7G7A77GCCC77C77CCCC7CCC7777G77ACA7 
CAGAG77AGCA7A7CA7AAA7ACGG7C7GCA7777C77C77777CAGC7A 
7CAGCA7G7777GGAGAGGA777CA7A77CG7GCAGACAGCA7G7A77AG 
7CAG7CCTTGCA77GC7A7AAGGAAA7ACC7GAGAC7GCA7AA777A7AA 
AGAAAAGAGG777AA77GGC7CACAGC77CGCAGGC7G77CCACAGGAAG 
CA7GGCAGCA7C7GC77C7GGGGAGGCC77AGGAAGC7777AC7CA7GCA 
GAAGACAAAGCGGGAG7GGA7G7C77ATA7GGCAGGAGCAGGAC7GAGAG 
AGAGAGAGAGAGAGAGAAAGGA7GCCACA7AC7777AAACAACCAGA7C7 
737GGGAAC7C7G7CACGAGAACAGCACCAAAGGGA7AG7GC7AAACCA7 
TCATAAGAACTCCACCCCCA7GA7CCAA7CACCCCACACCAGGCCCCACC 
TCCAACA7CGGGGA77ACAA777GACA7GAGA777GGGC7GGGACACAGA 
ACCAAACAA7ACCAGAG7GC777C7CA77C7777C7A7AGC7GCC7AG7A 
77CTA7G7CC77TACTTCATTTAGGCAG7CTC7TGT7GATAGACACTTGG 
G77AC77CCAA77777CC7A77ACAAA7GA7G7GCAA7GAA7AA7777GA 
TCA7777CCA777CACA7GGG77A7G7CCA7C7G7GGGA7AAA7C7CCAG 
GAG7GAAA77GC7GGA7CAAAGGGGAAG7GCAC77G7GA7777CA7AG77 
AGCAAA7777G77C7A7AAGGG7CA7A7GAA777A7AG7CCCACGCG7AA 
TA777AACAG7GGGGA777CCCGACAG777GACCAACAAGG7C7G77G7T 
AAACTTT7GA77777G7CAA7CTGA7GGGAAAA7AC7AG7A7C7CAAAG7 
GC7777AA777GAC777C77A77ACAA7G77AAGCA7CA7777AC7C7GC 
CCAAGA7CAAA7AG7A7777C777TC7G7GAACAGAC7G77AAGA7CCC7 
TGCC7CT7G77T7GC7GGA77777G77C77777777CAAA7G7777GAGG 
CAG77C7T7ACATG7GAAACAAG77A7C7C777A7C7GGGG7G7GAG77A 
CAAC7AC7777CC7C7GGC77G7777GCGC777GAC777GC77C7GG7GA 
T7CCCGCAATTCTGAAAG7GTACTT77TGCATCA77CA77C7TA7ACA.ee 
CA7GC7C77G77CACGC7GG77CC7C7ACC7GAGGGC77777C7777C7G 
C77C7A7C7GGGAACA7777777GAGAGAGAG7C7CAC7C7C7CGCCCAG 
GC7GGAG7AG7GCAA7GGCGCGA7C77AGC7CAC7GCAACCTCCACC7CC 
7GGG7TCAAGCAAT7C7CCTGCC7CAGCCTCCCAAGTAGCTGGGATTACA 
GGAGCGCACCACCAAGCCCAGC7AATT7GTTGATTTATTTATT7AT7TTT 
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GGCTCAAGCGATCCACCCACCTCGGCCACCCAAAGTGCTGGGATTACAGG 

C3TAAGCCACCATGCCCAGCCCATGTGTGGAAATCTTCTGTTTATCCCTT 

7AGGC77GA77C77A7G7CG77C7CC7CCC7CC77CC7GGA7AC7CC7C7 

T377C777A7C77AC7C7AC77G7CA7G77ACC77G777C7GC77A7AAC 

TAGC7GCCTCTCCTATCTGAGGAGGGACT7G7GACTGTTCTCATCTC7GT 

AC7CCCAGC7CC7AG7ACA7AGCGC77GC7CAACAGA7G777GG7GCA77 

GA7AGA7AAA7CAC7GG7AGC7G77AC7ACCAG7CC7GAC7CCCTGCAG7 

GC77CAGC7GA7CC7G77CCAGA7G7GCAC7GAA7A7CC77C7G77GAAC 

AACAGAAA7AAAGGGGA7GGG7GAGGAGGA7AG7C77CGG7GGCCAAGGA 

7A7TTTTAGG7AC777GCAGCAC7CAGCAA7GAGGAG7GGGC777AG7CC 

CCCAAGAAC7C7CACAGCCC7GGG7G7C777AC7G77CAG7G7CAAATCC 

AAGACAAG7CAA7GA7CAGGAAAGACCA777777777G77CAG7GAAG77 

7A777CAGAA7CA77GAACAG7A7GA7A777GG7AA777CA7AAA7A77C 

CCAC77AAAA7GA7CGGAGCAGA7A7A7777CAG7CG7AA77AAAGGACA 

7GA777AAAGAGAGCACACCAG7CCAAA77GAAA7GA77CCA7AGC7A77 

AAAAAAC7AGGG777777ACAGACAA7GA7AC77777GCCCCC777GAA7 

AGA77AGACCAA7GAA7AAAACAAACAAACAAA7AAA7AAA7AAA7AGGG 

AAGCGG77GC7CA7CAGAA7G7GGGAGCGAA7GACAGAGGG777C77AGA 

ACCAAA7G7GGCCG7GG777C7G7CAGGCG7GC777AAG7GAG7AGGAGA 

GG7GAGAGAGGCC7GGC7CAACAAAAGGGC7GGGGA77G7CCC7GAAGAA 

CCAGAGC7GAN77NCA7CAGGAG7AACANAGG7AGA7AG 

>Contig4l 

CCGCG77GAGG77CCACGCAG77CAAA77A7G7CCAA77ATCAACA77AA 
TGCACA7777CAA7AGAACC7G77CCGGC7777C77AGGAGGGGGGCGGG 
GAGACG77G77C7C7GGGAA7AAG7G7ACGCAGGAGGC7GAGAAGGC77C 
A77CCA7AGCA77CAC77ACC7CCAGC7G7AGAG7GGGC77A7CA7C777 




G7A7777G7CA77AC777C77C7CC77G7ACAAAGGACA7GGAGAACACC 
AC7GAAGAAAGAAGGGGG7C77G7GG77AGGGACACAGCAG7GCAGGG7C 
ACCCCAACCCC7AGGCCCCA7GAG7AGGA7ACA7G7AA777GG7AGCC7C 
7G7GGGAACCCACAG7GAGG77CC77GGCC7AAGACACAGGA7AAC77GA 
C77C7CACAGACAA7AGCAGGG7CA7777G77GA777AGGG777CCCC7C 
AAAGGC37GAGGG777C7CAGAGCC7CA7AGCAG7AGGAACGGAGAA7GA 
AAGAGGG7C7ACA7777AAA7GC7GAAGGAAGGAAGGAAGGAAGCCA77G 
7G7CAC7GGC7GGCAA7G7GCCCA7CCACAGGAGCGGAACAAC77GA7CA 
A7G7GGAAGGAAAGGAAAGAGG7GAGGC7G7AC77C7GCCAGAAA7CAGG 
CACCAGAAC7G777CAGGAACAGAGAG7AGCCCA7GGGAAGAAAC7GGGA 
3AGGAGAGGC7GAGC7GGGAAAG7GGC7CCAAAGAGAGACAC7CA7777G 
A7C773C7CAG7CACAGCAG7G7CAA77GGAGGCCC7GGGA7CAC7C77A 
C7ACCCGA77CCAAAGAAACAGGA7777C77GGCC7GGC7GAGAGCAAA7 
AGC77CCCCC7GAG7GAGGC7G7CC77CAAAG7CAGCAGCC77AG77GCC 
CACAC7CC7G7GCAGAGGC777GGC7AC7G7GGCACGA7GCCAGGCAGA7 
CACCACAGCTAA7GA7GGG77CACCGCAC77GAAAC7777GCCCG77ACA 
GCGGAGAGA7A7AAG77CC7GC7GGGCGG7AAAA777CCC7ACAAGGAAC 
CACC7GGCA77GGG7GGGACGGA7G77GGGGCAAGGGGGGAAGAC7GGGG 
AGGGGGA7GGACACA77A7CGC7CCAGCAC7C77G777CAGCC7CAACAA 
CAGGAAGAGAGAACCCACAGGCAG77AGGCCA7G7CCA7CAAA7GACCCC 
A7A77G7GGAAGAA77GACA77GCAC7A7GCCCAAGAGAC77GGG7GGAC 
A7GG7CC7GGGAG7GC77GAGCCG7C7AA777C7CAGGG7CACAC7CC7G 
77AACAAA7GCAC7GGCCAG7GCAA7CAAA7G7GCCA777C7AGGACCAA 
AG777G7A7A77CC77777AA7A77777777CAC77G7G77GA7CA777G 
CC77AAA77AAC777C7AC777G777AAAACA7GGAGAA77AGCAAGC7G 
CCAGGAGGCCAGGCAGGGAAACCAGGA7G777CCA777ACC77G77GC7C 
CA7A7CC7G7CCC7GGAGG7GGAGAGC777CAG77CA7A7GGACCAGACA 
7CACCAAGC777777GC7G7GAG7CCCGGAGCG7GCAG77CAG7GA7CG7 
ACAGGTGCA7CG7GCACA7AAGC77CG77A7CCCA7G7G7CGAAGAAGA7 
AGG77Z7GAAA7G7GGAGCACA7G77G777AGG7A7AAAA7CAGAAGGGC 
AGGCC7CG7GAGGCGAGG7GGCAAAA777GA777CT7GGAGGACACC7GA 
3CA7A7ACGG7CAAAG7C7GA7GACAACACCAG7AGGGA7GAAGC7GGGA 
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37GGGG7GGC7AAGAAL. .ctggacctgacactattagacatggg^gc. .J 
C77CAGG7C7A77AC7GC7CAC7G7GGCCGAGCAACAGAGC7AC77AGG7 
AAAA7GG7GA7GG7CATAACAC7AGCCCACAGGGAGG77ACGAACC7C7G 
G7GACAA7G7AAGTGAAAGGCCCC7GAGAAAGAG7GAGGGAG77GCAAA7 
GTCAGTAGCCATCAAGATCTTCTTTAAGAATAGTTTCCACTAAAGAGATG 
A77GC777GG777CCAGCC77C7TTGT777G7C7CCCCGC7GGGCC77C7 
ACC777AAAGGGC7T7GGC7C7GGGGGAA77GAG77GGC7GGGGC77GAT 
GAC77CCAAGAGGACACAAG7GGAGA7C7AC7GCC7GC7C77GGC7AAC7 
ACC77C77CAAAGA7GAAGGGAAAGAAGG7GC7CAGG7CA77C7CC7GGA 
AGG7C7G7GGGCAGGGAACCAGCA7C77CC7CAGC77G7CCA7GGCCACA 
ACAAC7GACGCGGCC7GCC7GAAGCCC77GC7G7AG7GG7GG7CGGAGA7 
TCG7AGC7GGA7GCCGCCA7CCAGAGGGCAGAGG7CCAGG7CC7GGAAGG 
AGCAC7GCGGAGAGAGCGAGGGAGGGAGCC7GG7GAGG7GG7CC7GCCAG 
GAAC CAT G C777GACA7CAGAGAG7AGAAAGC7 CAGAGAGGAGGAAAGGG 
CTTGAAAGAATCCCGAGC77CTAAAGATCA7CCC7C7C7GGGCCAGGCG7 
GG7GGC7CA7GCCTGTAATCCCAGCACTTTGGGAAGCCGAGGTGGATGAA 
TCAT77AGGTCAGGAC77CAAAACCAGCC7GGCCAACA7GGCGAAACCCC 
T7C7C7AC7AAAAATACAAAAA77AGC7GGG7G7GG7GGGG7GCACC7G7 
AATCC7AGC7A77CAGGAGAC7GAGGAAGGAGAATCGC77GAAC7CAGGA 
GG7GGAGGATGCAGTAAGCCAAGAT7GTACCAC7GCAC7CCAGCC7GGGC 
AACAGAG7GAGAC7C7GTC7CATAAAACAAAACAAAACAAAACAAAACAA 
AATAAAA7AAAATAAAATAAAAAGAT7A7CCC7C7C7GAAGC7CAAGGAG 
GTTAAGGG7G7ACTCAAGGGCACACAGCAGG77AGAGGCAGACTCAAGAT 
7AGAA7G7GGGC777C7GACACC77ACAGGC7A77C7777AGAATAAATC 
CCA777C7AC777G77CA7C777777GTACATGCCCCACC7ACACCA7AC 
A7G7A7ACC77C7C7A7A7C77777GTA7CCC7AA7GCTG7CACAC7A7G 
A777GC77777CA7GCAGA7GACCATAACA7777CCA7TCACC7A7GC7C 
AC7CAGCAAG7A77CAA777T7C7ACAC7G77C7T777TT7CC77777CA 
7AACACTG7C7CA7AGGCAT7C7GCAAATCC7G7GAGAG7AC7777TG7G 
AAATG77ACCAC77TCC7C7TAT7CAGAGAAGCTCCG7A7TAAGGCTTCA 
CTGAGG77GCC77AAGGCA7GATAA7GG77CAAAGGC77GAAAGACAG77 
AAAGAGACC7G7AAG7GCACAAAAGAAAG7TGAGCAGGAGAGAA7TTCC7 
GCCTGGAGCAGAGCCAAGC7GCTGGAAGAGGCAA7GGGGGCAAAGGCCAG 
GCAGACAAGCCAATGGGCTCCTCCCACAGC7GCAGCCAACAAG7TATGCC 
AG7C77AAAAC77C7AAAGAAATA7GT7777AACAAGA7TGAGGAC7GGA 
77ATGAGGC7AGGGGAGGC7A7CACAAAC7GGAA7AAAA7AAAGCCAGAG 
AAAAG7GGC7GCCTTCCAACC7GCACAAC7GACC7AGC7AGGC7GATGGC 
7GGGCCACC7AGGAAGGC7AC7GAGCA7CA7A7AAAACAGAAGGGACAGC 
AGGAA7A7AACATGGCTCTTTGTAAGGATGAG7C7GAAAAATGACCATTT 
GC7GCCCAAA7GCCCTTAGCTACAACTGAAAATA7TTCAGAACTGGAGGT 
7GCAGGATGC7GGAA7C7CAGAGATCATCCAGC7CAGCCC7TTAT7T77C 
AGA7GAGG7CCAAAGCGGG7AAAATGAC77G7CAAGG7CAAACAGCAAGT 
GAA7GG7777C777CAAGTC7CAAT7CA7C7777TG7T7ATATCATC7A7 
G7C77G77G77A7AAGCT7CACCCCAGGTAGCAAAAAAC7AT7C7AC7CA 
AAAGGGG7AGACA7A7G77AGT7C7CAAGA7CA7C7C77GGT77CAGAG7 
T7AAC7CAAGTGA77GGCA7AGGC7GAA7CCA7C7C77AAAAGGA7AA7C 
AAA777A7G77GAAGAC77GGT7GTCTTCC7AC7A7GAAA7GGGAAACA7 
7ATCAC7ACTCC7CCCC7G7CACCACCAAG7G7GGCCACCACCACCAACG 
77AG7GAG7GAC7G7GG7GA7ATGA7GACCAAG7GGCCAGGTCAGCAAG7 
GGTGCAGCC7G7G7C7CACTGGAAGAGG77AAAGTC777C7AAAACAAAA 
7ACCATGGCATCAAAGTGGCCCAGAAC7CCC77CTT7GAGCTTTCCC7G7 
G7TAGAGCCC77CC77GGG77GGGAG77AAACCCA7AG7C77ACC77CA7 
C7G777AGGGCCA7CAGC7TCAAAGAACAAGTCA7CC7CAT7GCCAC7G7 
AATAAAAACAGGGACA7G7C7CAA77A7G7C77C7AAACAGG777A7777 
TCC77CCC7G7G7ACAAGAC7TGACTG77CATAAGAAAC7GCAAACAGCC 
7GCC7C7CAAAGCTGCCTGAAACACC7GGCAAG777CACAG7GATA7GCG 
CAGAACAG7CCAGAAGGCAGA77C7AGGCCTGGCAGG7GGGCACCC7GGG 
7GC7CCC7G77GGA7C77GAGGCC7AACC7C7AGCCCAGCAGAG7CAGC7 
AAAA7C7GAGCTCTCCCTCTCCCTCCAAGCCACAC77TGCAAAGGGATTC 
C77G7A77G7GGGC77GGAATC7777C7CCCCA777GCC7C7GCAGGAAG 
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C C 2 7T SCAAC AACACA TGGATAGCCT C CAGG7C C CAAGGC7GGAGC \ 
CTTGTAATGGGAAAGTAGTCTTTAAATCAGATTTACTTGGCACCCTGTTT 
GCCACTGAAAGAGGCAATTTAGGGGAAAAATCTGGTCTCCAAGCACAGAT 
AACAC7C7AC7C77GAAAGAGGAGACC7GC7CA7G77AC7GG7C7CAGCG 
7C7CCAC7GACC7G7AA7AAGCCA7CA777CAC7GGCGAGC7CAGG7AC7 
TCTGCCATGGCTGCTTCAGACACCTGTGTAAAAAGGAGAAAATGAGTGAC 
TTCCCCATGACGGC7ACGTTCATGTGTGATTTCTCTCAGCATCGAGTGCA 
TGGCAG7CA7GCAAAGAAA7GA7C7C7GAG7AAA7GAA7GAA7G7G7GAA 
AGAGAAGTCCTTTGGGTCTAGAGAAAAGCATTTGCTAAACCAAACCCCAA 
CTAGCAATGTATTGGCTAGGAGAGCTGGAGCAGAGGCTTTGACACTAACC 
TTTAGGG7GTCAGCTGTTAGATAAGCAGTATCCATTCCCAGAATATTTCC 
ZGAGTCATAAGCATTATATTACACCTGGCATTTTTGCAAAAAGCTGAGAG 
AGGGAGGCAGAGAGGGAAGGAGAGGGAGAGACAGAGAAAGAAAGAGAGAG 
AGAGAGAGAATATGCATACACACAAAGAGGCAGAGAGACAGAGAGACTCC 
CTTAGCACCTAGTTGTAAGGAAGATTAAAGTCATACTTGAGCAATGAAGA 
TTGGCTGAAGAGAATCCCAGAGCAGCCTGTTGTGCCTTGTGCCTCGAAGA 
GGTTTGGTATCTGCCAGTTTCTCCCTCGCTGTTTTTATAGCTTTCAAAAG 
CAGAAGTAGGAGGCTGAGAAATTTCTCTGTTGAATACCTGATTTCACAAT 
CAAGTTAAAGGAAAGGGGAAAAGAGTATTGGTGGAAGCTTCTTAGGGGAG 
GGGACTAATAAACTGAGATAATTCTCTGGTTCATGGAAGGGCAAGGAGTA 
GCAAACTATGACACATTTTGCAAATGTATCACCATGCAAATA^FGCATTGT 
TTTCCTGACAATCGTTGTGCAGTTGATGTCCACATTAAAATACTGGATTT 
TCCCACGTTAGAAGAATGTTTAAATTTAGTATATG7GGGACAAAG7GGAA 
GACACACAGA777A7ACA7GCACA7AC7777C77CA77CAC77C777G7A 
C77AAG777AGGAA7C77CCCAC77ACAGA7GGA7AAA7GGG7ACAA7GA 
AGGGCCAA7AGCCC7CCC7G7C7G7A77GAGGG7G7GGG7C7C7ACC77G 
GG7GC7G77C7C7GCC7CGGGAGC7C7C7G7CAA77GCAGGAGCC7C7GA 
GGAGAAAA77GACC777C77GGCTGGGGCAGAGAACA7ACGG7ATGCAGG 
G77CAGGC7CC7GACGGAG77GGGGCAACCC7GGAGA7AAGC7CACACAA 
CCC7GCAAGACCAGG7GC7G77ACCC7AGCCAA7C7CA7GGA7GAACCAG 
A7CAA7GCCAGA7GAGC7C7GCC7AAAA7GA777777GG7GAAC7C7GAA 
AAG7GGAA7A77G777C7G7AAGAA7A7CCA7C7GAGAC7C7A7C7C77G 
G7AA7ACCAAGAG77A7CAG777C7C777AACCGAGACACCAGCAAAG7G 
CC7GC7CCAGGG7AA7GCCCAGGGGAGCCC7CCA777G7AGAA7GAA7GA 
GAG7CCAGG77A7GAACAG7GCC7GGAG7G7AGGAACACCC7CC777GCC 



7C7CAC7C7G7CGCCCAGGC7GGAG7GCAG7GGCACGA7C7CGGCCCCC7 
GCAAG77CCGCC7CCCGGGTTCACACCATTCTCCTGCCTCAGCCTCCCCA 
3CAGC73GGAC7ACAGGCACCTGCCGCCACGCCCGGCTAA77T777GTA7 
T777AG7AGAGACAGGG777CACCA7G77AGCCAGGA7GG7C7CGA7C7C 
C7GACC77G7GA7C7GCCCGCC7CGGCC7CCCAAAG7G77GGGA77ACAG 
GCG7GAGCCACCG7G7CCAGCC7G7AACACT7C77A7AGCAC7GAG77GA 
AACC77GC7CC7CC7GG7TCC7CCAGGAAAC7GAAA7C77777GAGCCAA 
G7C7AGCACAG7GCC7GGCA7G7ACA77CAGG7GG7AGAG777GC7GC77 
GAA7GGG7GAA7GGGAA777GACAGCA777T7A77CAAA77AG7A7G7GC 
CAGG7A7CG7GC7CGC7C7GCA77A7CCAAGGGAG7GAGCC7C7G7GCAA 
G7A777GAGACACGAGGGAAA7AGG77C7AC7G7GGGAAAAAGAGCA777 
CA7GGAC77GC7C7CCAAGCAGCC77C7GAT7777AA777GGC7CCCAG7 
A7C77GA7A7CAGGAG7CAG7CACAAGAAC7CCA7C777AG7AAG77A7A 
7777CCACAGGAAA7C7AAAAGC7G77CAACA7G77AG777CC7G7GAA7 
77GA7AAGCCA7AA7C CA77CC7AACAC7GAGCCC7CC7GAAA777GG7G 
7C7GG7CC7GCAGA7AGC7AAAAGCCC7G7C7GGG7GGCC7AGGGAC7CC 
7C7G7777GCC7CCACAGGA7CCAC777GCAAA77AACCAC7GG77C7CC 
CG77G7AGGAAC7GCCACC77CC7CAGAGCC7G7C777C77CC77CC77C 
C77CC77CC7C77TC77777C777C7C7C7C7C777C777C7777C7777 
C777C777C777C777C777C777C777C777C777C777C777C777C7 
7CC777C777C7C777C7C7C777C7C7C777C7C777C777C777C7C7 
C7CCC7CCC7CCC7C7C7C7C777C777C77777C777C7777C7C7777 
C777C7C7C777C777C7CCC7CCC7C7C7C7C77777C777G7C7C7CC 
CTCCCTTCTCTCTC7CTTTCTCTTTCTCTCTCTCTCTCTCCTAGACAGGA 




'777777GAGACAGAG 
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rCTACCTTTATCCCCC.^CTGGAGTGCAGTGGTACAATCATGGAifc^i 

TGCATGATCACAGCAGCCTCAAACCCTTCCTCAGAGTCTTTATGCGGCAA 

CCAGCAGGGTCTGGAGGGTTGGTGGCTCTGTGAACTCTCCTGACAGAACA 

CAGAGA7G7C777GG7C7G77GA7G7GA77ACAAGC7GAACGAAGGAAGA 

TCAAAGCCAGTGACAGGAAGGGAGATATGCAAGGGACCCGAGCATCAGCT 

CTGAGT7AGTCCATTCTGCTTCTGGGACTTGGGATACAGGTCAGAAACCT 

7GAGC77C7AC77C7CCA7C77CCAA77G7AGCA7CCAGGACC7CAGAA7 

CfGCCAGC7AAGAGGAGCCC7AA7GA77G7C7GG7GGGA7A7GG7GGGAC 

CACAGAGA7GAAGACA7GAA7AGC7A777GAA7G7GAACAGCAGACGAAG 

AAA7CAAGGC7AGGAGGGTGGAAG7GAC7CA7CCAA7AGCACAG7G7GGT 

TGAAGCAGCAC7AG7A7CCAGG77GCA7GAGCCCC7GA7GC777CGC7CG 

AGGGAAA7777GGAGCCA7GGGGCAA7GCCCCC7GACG7AACAG7C7CCA 

CAG77C7GCCA7G7C7CA7CC7GGCCC7G7AACC7GGACCCAAA7C7GC7 

ACCA7CCCA7CCATC7CAGGAAG7GAAACC7C77A7G7CAAA7AGG77GT 

GCAACG7A7G7A7CAGA7CC7G7C77CCCAAGGAGACCGC7CAGGCCACA 

GCAC77CC77CCGA7CCCCAA7GAGCAGAAAA7A7C7CGC7A7AAACA7A 

G77GGCAC7AAGGGAGGGAG7GGAAGAG7GA7GA7GA7G7AGA7GG7GA7 

G7 AG C C C CAAGGAAG7GGAACAAGCAGAGA7GGGG AGC7GGAAA7G C CAG 

GA7GC7CCAGC7777GGGGAA77A77CAGC7C77GAG7CAC7AAAGCC77 

7C7CAGC7GCAAG77CC7C777ACCC7G7CAGG7CA77C77CCAAGACAG 

GAGAC7GACA777AT7CAAAGCAGCAAG7GCCC7GA7ACCA7C7TGTG7C 

TAA7CA7GGGC77CGCAGCCAG77A7CAAGG77GA7C7CA7C7CA77GG7 

C77CAA7CA7777GAACAAGAAGACAAGCAAAA7AA7CA7GGG77AG77C 

77A7A77A77G7G7G7ACA7GCAG7GA7G7C7G77C777G7AG7GAGC7G 

77CC77CC77G77CACCC7C77GC77AGAACAGAAC7AAGCAA7C7GCCC 

CCAACA7777CCCCAA777CCCA7C7CA77C77GGCAC7GGC77CC7AAT 

A777G77C77A7GAG7GA7777C77G7A7CA777CCA7GAG7CCC7C7GG 

GA7C77AAAG7A7GAAAAA7G77G7G7G7ACCCACACC7G7C777G7GGA 

7A777C7C7CC7T7CCC77C7GC77C7GGGA77A777GGGAA7GGGCAC7 

ATGA77777A7CA7A7CGC77CCAC77CC777A7GGCA7CA7C7CCAA7G 

GGC77C77C7CCC7C77GGA7CCAGG77C7CAGA77GGGGACA7GCAGAG 

7CCAAGGAACA77CCA77C7CC7CCC7GG7C7AGAACAAGGAGGGC77AG 

ATA7A7GAGCAGG7GGC7GGGGC7GGCGAGC7A7G7AG7C7CCAATGGC7 

777CCC7GA7G7CGGAG77G77A7G7CAG77C7GGGAGACCAATAAGACC 

77G7CC77CC777GGA7CCA7CAGAAAAAGCCCC7GGG7GGG7AAGA7GG 

A7GGCAGGGC7C7CC7AC7C7A7G7C7777C7CACACC7AG7GGG7A7AA 

GAGAGGGGACCACAAACAGAGGGGGC7C7GG7ACCAC77A7CCAGGG7C7 

GGAAACA7777C7G7AAAGGGCCAGA7AA7AAA7G777CAGG7ACAAC7A 

C7CAACC77GCA7CA777CAGAAAAGCAG7CAGA7AA7ACA7AAA7GAA7 

3GG7G7GGC7GGAC77G7CC7GCGG7CCCC7G7C77A7A7CA77G7A77A 

T A7 C AT77777 C77ACA7ACAAA777AGAAGCAA7AC77AAAAAAAAAAA 

GCCG7CC777A77GAGCACC7AC7AAG7GCCAGG7ACC7777TT7CCC7C 

A77A7C77A77AAC7C77CA7AA7AACC777AAAG7AGA7AA7A77GAAC 

CA777GACC7A7GCAGAAAC7GAGG77GAGACAA7AAA77A777AAGACC 

GCACAAACAG7AAA7GC7GGAACTACGAC7CAAA7A7GGG7TAAC7GAAC 

CAAAACCAGA7C777A777C7CAC7777AAT7G77ACA7A7G777A77GC 

C7CA7C7CC7G7CCACA7GG7GCCCA7CGGCAGAC7CC777C7CA77C7C 

AG7GA77GAG7GACA77C7AAAC7ACA77GGCC7GGCAGA77CACC7C7G 

TCCCC7AAA7G777CCACA77G7CC7777AGGA77GAGA7CC7C7C7G77 

CCC77G7C77CCC7CC77TC77C77C7GGCGG7GACG7GC7G7G7GAA77 

7G777C777C7CC7C7CAGGG7AG7AC7GGGAC777CCAAA7CAGGG777 

77AG7GA7C7CTC77CCC7777C7GAG777C77CC77A77CCCA77CAC7 

77C7CA7C7A7AAG7GGCAGC777G77GC7GGAGGA777CC777G7CC77 

77A77C77C777AAGAC7T7G7CA7AAC7G7CAAAAGCAA7CCC77GAAG 

G7A7C7G7CC77GGAA77G7G7GC77A7GA7GC7GAAAAA7AC7C7C77C 

C7AAAGC7A77A7AAA7GC7 

>Contig42 

GGC7AGC7GCAAC7C77GAA7ACAAACACA77CAGACA7GCACACAC777 
C7GG C7 C C CAAAAAG AAAAAAAAAAAT CAA777A7AA7AA77C7GA7C C7 
T7GC77A777CCACAAAC7CCA7GAAAA77G7ACA77G7CCAAGCAACA7 
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7TC7TAATATTCT CTTi . TCTCTCATATCCATTTTCCTTACTGCTGTC * J 1 

CACCTATCTCTTCCAAACTCCCTGTTAAAATCCCTGCCCCAGCGAACTTT 

TA77CAA7777G7GGAA7GGAGGC7GCAC7GA777AAA7TAAAAAAAAAA 

AAAAAATCCCTACTCCATGTCCCAGATCCCTAGTTGTTTTTTGTTTTTTG 

TTTTCCTGAGACAGGGTCTTGTGTCTTCCATGC7GGAGTGCAGTGGCATG 

ATCATGGCTCACTGCAGCCTCAACCTGCTGGGC7CAAGTAATTCTCTTGC 

C7CAGCCTCCCCAGTAGC7GGGAG77CAGG7A7G7GC7ACCA7GCC7AGC 

7AA7777777C777TAT777G7AGAGACACGG7C77GCCAGG77GCCCAG 

GC7GG7C7AGAACCCC7GGGCGGACG7GA7CCGCC7GCC7CGGCC7CCCA 

AAG7GC7GGGA77ACAGGCG7GAGCCAC7GC7CCCGGCCT7GGG7GCAAA 

T77GAGC777C7CAC77A77AG7G7AAGACA7ACAGC7AA777C7AAA7C 

77CCAAACC7CAGATT777CA7CCA7GAAG7GAGGA77A77A7AGAGC7C 

AC7AA7AACA7GGC77CAAAAA7A7A7AA7GCCAAAA77GAGA7CAAAA7 

AA7AAA7C7A7A7TACA7GGGAGA7C77AA7G7ACC7C7TA7A77A77GA 

TAGAC7AAGA7GATCAAAAAAA7AGAAAGAGAGCAG7AAGGAGAGCAAGC 

A777AA7CAA7AGGACCAA7ACA7777AA7CAA7AGGA7CCTCAGGAA7A 

7A7 ACAGAA7AC CAAACC7AACAAC7GCAGAAAACA7GCCAAACA777AG 

G7ACAGACA77G7TGGAAAA7GCAA7C77GAAACGAG7GGAC7GACA77C 

AGAAGA7A77AA7AAGAGCAC7AA7GA7GGGGA77GCAACCA7G7C777A 

C7GAC77CCAGAAGCT7CT7ACAG7AAACA7GAAA7CACA7AA777C77C 

CACTTTCCTAGTGTTTCTTGTTCTGGGCTCTGTCCTGCTTACTGTCTAAT^"*** 

ATC77GGCCCCTTAAAAGTTGCTAA7CTTCCAAACCTCATTCC7GTGACT 

GGGCCGC7GG7CC7TG77CA7GGGCC77GAAAA7AC7GAC7G7ACAC77A 

7C7GGAGCA7CCAG7GCC7ACCACC7GACCCAGA77CC7CA77GCGC7CC 

7CCC7CC7CCACCTA77GGAA777GC7CA7ACCCG7G7GAGACCCC7CCC 

777CCCCCCA7C7GAA77777A7CAAGACAACGCAC7GCCA7AC7CCC7C 

G7ACCC7GC7C7GGGCA7CAGAC7GAA7G777G777CCA77GAGGA7C7G 

CAGC7GCA7CAG7T7CCCCAGCACCG7CCAACCCC77GAGCA7GGC7AG7 

CC7AAAGCAGAGAAT7AGCC777C7A7CCC7GC7GC7A7ACA7GC7GGGA 

CAAA7AA7AAGAAA7GACAGCA7777A7GA7AA7GCAGGC7GCAGGAGGC 

AGGAGGCAGGAA7CAAA77CG7GC77A7CAAA7AG7GC7CCAA77C7TTG 

AATA77GGAC7A7AGAA7A7G7CA7GGA7C7A7GC7CAGG7GGG77CCC7 

A77AC7CAC7CCAC7GAGGCCAGG77G7GGGA77AGC7G7CCAAGAGGGA 

G7T7CAG7C7CACAGCA7AGGG7CA77C7GAGAA77AC7GGCCCACAC77 

G7G7GGAGACC7CCAGAGAACAGAA7C7GGG77GG7GCCA7G7AC77CCA 

GGAGGAGAGAAG7GGCAGGA7GCCCAGCCCCACAA7CAGAGGGGAAGGGG 

CAGAGCCACA7G7A7GAAGA7CC7C7CCCCAG7ACG7GCCAA7CACAGGG 

C77CC7AGCT777GGGCCAAGGAAACAA7G7GGGAAGCAAAAAAGGACAA 

T7T7C7CC7CCC7T7GCA7GAAGAC7GAGCAG7777ACCAGA77CCCAGG 

GAAACACCCT7CCACTC7GGG77GAA7G7GAG7GAGAGACA77CAGC7GG 

AACAC7AGAAAAAC7ATITCC7GAGCGAC7CACC777AGCCC7AGAAAG7 

G77GGA777G7CCT7CA7C7TTGCCACAG7AGAGAC7GC7GA7AGCA7CA 

GAAC77GGGC7C7GGAA77AGACAGA7A7GGG7ACAAATC7GAGC7C7C7 

CAC77A77AG7G7GGGATG7AGAGCAAC7777AAAA7CC77CCAAACC7C 

AGAC77C7CA7GCATGATG7GAGGA77G7AA7AGGGCCCACC7AA7AGGG 

G77777GAGAA77AAAAAAG77A77CAA7GAACAGCA7T7AGCAAGA7GC 

C7GACCA77GAGAAAA7AACAAA77G777A77A77A77G77A77A77AAA 

CA7C777CC7GCACC77C7GAC7GGGGGCA7CG7A7CA7CAGAAA7AC77 

AGGA7GGGA7GGAT7CC7GCATGGGC7GAG7CAAGGG7GCAA7AA7GGAG 

GAG7GAAGAAGGAAGAAA7GGAGGCAGAAA7CCCCAGGAGCCCAGCA7GG 

TACAAGGC7GAGC7AG7GC7GCAGAGCC7CC77GGAACAGCCACAGAGC7 

7GCA7C7GGCCC7GGGAGGAACC7C77C7AGC7GGCAGGACCAGCCACAA 

CAG7GGCCAGGGGA777CCCAGGGCG7GGGC7CC7AGGAG77CA777GGA 

CCAAGCC7GCC7GGAGAGGGG77A7AACAGGGA7CC7TCCC7AC7GGCAG 

G7GA777ACCCC7CGG7GAGAAGC7CAGGCA777G777GA7GGAAGG7GG 

AAGGCCC7G7GC7GGGCCAG7GAC7A7CAGGGA7GGGCGGG7GGC7GGAA 

AA7AGCAAA7AAGACAA7A7GA7AACACAG77AACCACCACAC7A7G7GA 

AGC7ACAA7A7GGG7ATC7G7AA7AGACAA77CCAA7GTAGAGAA7AA77 

C7AAGG7G7CA77C7CCCCGCCAA7GCCA7AAGCACACGGCC7C7GCC7G 

GG777C7CAC7G7GGAA7G7CC7CC7GG7C7CC7CA7GCCCAGAGAG7GG 
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GAAGT ACT Z CT ACTTT, . .CACCGGC7TTCCTG7CA7CTCCCTGCAGCC. " 
CCTCAGCCCCC7C7GCACAGGGAGG777CC7CCC7GC7GC7GCAGTGCTT 
TGTACTTGTTAGTGGTACCTGCACACAGGTATTGGTGTCCTTGTCTCACC 
ACCCTACA7CAC7GTAAGCTCCCCAGGAGCAGGCTTCCTGTTTGACTCAC 
CTG7GA7CC7CCACCTCCCACCC7G7AG7GCC7CAAGCAT7GAGGACAA7 
CAC7GGCTGCCCCTTAACCCAGAAATGCTGCCGAGACAGGAGGCCATGGC 
CCAAG77CC7GGAATGGGG7ATTAC7A7G7CAGCACAAAGGCCT77GCAC 
AAA7GAAGGCTTTAAAAATGCAG7CC7AG7CAGG7GGAGGAGGGCT7A7A 
GGA7TCCCAGGAA7CTGGA7CA77C7C77GAGAGC7T7CCCTTG7C7C7G 
77AAAAC7CACA7CCTACGGCCCAAA7AACAACAAAAAA7GGA7G7AAA7 
TC7TGAAATAACT7G7GGA7GGGGGAACAAGGCCCACCCCCCAGATCTGC 
CAGAAGC77CAGG7GAGGG7CCCAAA7GCCAAAAAG7CTGGTA7CAGAGA 
GGATGGCCAG7GACCTGGGGACACA7GCCC777GC7G7G7CAC7CAAGGA 
GCAGCAGCC7CGGCCCCGCACAGTGACCAGGACCC7GGC7TCCCACGCTG 
GGCAGGAGC7GG7GTC7GA7GAAGGGAA7GCC7GGCAGCACGTGC7G7CT 
G7C7CCTCG7G7CAGC 7TAC C7GGC7T7GC7GCGAAGAGGCCAC7CGCA7 
7TCTGAA777777ATATTT7TTTAA77TT77AAA77TT77A!TTT7ATT77 
TAT7777A777A7T7A7T7A77777AA77TTT7T77AA777777AAA77A 
TGC7T7AAG7777AGGGTACA7G7GCACA77G7GCAGG77AGT7ACATAC 
GCA7ACA7GCGCCATGCTGG7GCGC7GCACCCAC7AAC7CG7CA7C7AGC 
-A77AGG7A7A7C7CCGAGTGGTA7CCC7CCCCCC7CCCCCCACCCCACAA'^ 
CAG7CCCCAGAA7G7GATGTTCCCC7TCC7G7G7CCA7G7GATC7CA77G 
AA77TCT7TAAAGG7GGAA7CTC7CAG7GGGG7CTAA7C7G7TCAGAAA7 
A7CAAAAGAG7A7CCT7GGGAATGAC7GGAA77CCAGAG7CA7C7GG7AA 
7CC7CA7AAAACAAC7CC7GGA7G7C7CTCAGCACA7C7CCCACC77GAA 
CGCAGGAGGCTGG7TCAAA7GGAGGAGCA7CGC7C7AC7GCAC7TT777T 
7TTT777GGCC7AAAG7GCAAAAGGGGA7ACG777CA7G7AAA7AAATCA 
ACTGCAAA7CGC7AGT7A7GCTGAGCCCTGTCCCG7GCTG7GGACACAAA 
GGAACCAAAGGCT7TTCTCCCCGCCCAACACACACATAACACACACACAA 
AATCA7AAAAACA7ACATACCCCCAACACA7AACAACACACAACACACAC 
ACAAAA7A7A7ACACACAACACACACCAAACA7GCCCACAAACC7G7G7C 
CAGAGATAGA7CC7AC7GG7GGGT77GTGGTC7CGC7GAC7TCAAGAA7G 
AAGCCG7GGACC77CGCAG7GAGTG77ACAGC7C77AAAGA7GGCA7GGA 
7CCAAAGAG7GAGCAG7AGCAACGT7TACTGTGAAGAGCAAAAGGACAAA 
GC77CCACAACCCAGAAGGGGACCCCAGCAGGG77GC7GGTTGGGG7GGC 
CAGC7777AC77CCTT7TGGCCCC7CCCA7GT7C7G777CCA7CC7A7CA 
GAG7GCCCTTT777CAA7CC7CCC7G7GA77GGCTACT777AGAA7CC7G 
C7GA77GG7GCA77TTACAGAG7GC7GAT7GG7GCG7777ACAA7CCCC7 
TG7AAGACAGAAAAG77CC7GATTGG7G7G7T77ACAA7CCTC7TG7AAG 
ACAGAAAAG77CCCCAAG7CCCCAC7GGACCCAGGAAG7CCACG7GGCC7 
(2ACC7T7CAAC7CCA7AA7GGCA7GAAAA7ACA7A7G77G7ACAAAACA7 
ACA7ACACAAAG7ATACA7GCATC7CCCCAAA7A7ACACA7ACCACAGAA 
ACA7ACACACAGGAAC7CAGCTACC7GTCAAAAG7CTGCA7GG7GATTGC 
C7C7GCAG7GAGTAGT7AGAAAAGTGAATT7G777TTCAA7AAATTGGAG 
TCC77AAAAA7CG7TG7AAGA7AGAAAATT7T7AAAAG7A7ATAAAA7AA 
AATA7G7A7G7CC7TTGGTC7AGCATTTACACATG7AGGAATT7ATCCTA 
GTGGAG7AA7CAA7GA7ATA7GCAAAGATTTGGACAAGCATATTAAGCAC 
AGAA77A7G7A7GCA7ATG7G7GTG7A7A7A7A7A7A7A7C7CATACATA 
T AAT AA7G7AAAAGTGAAAA7AAC7CAGA7G77 CAAAA77G AGGATT AG7 
7AGAC7A7GA7CTGTCCATA7G7GACA7ACAAGTTAGC7GCCCCTTA77C 
TC7CGAGCT7CAACC7CC7A7AAACAG7G7CCC77G7A7A7CAG7A77GG 
7ACAGA7AA7CGAAC77A77GAGG77TTACA7GGGGCAATAAAGGCAAGA 
GT77A7GAA7AC7CCA7AC7ACAC7AGG7AGCACCCCC7A7TAAAGACAA 
AC7C77C7C7C7CAT77CCC7TCC77TCCGGAACCAC7TGGTTGAA7CTC 
7ACAAG7C7C7A77GCAAC7GCCTCAACA7GGCACCC7CCCTGCATC7CC 
A7CT7CCC7GTCC7GAGAGCAATGGCC7GC7GCCCCCACAC7CACA7CC7 
CA7TCA77CCAGAAG7GAGCACCACAGAAG7GCC7ACAG7TACCCCAACC 
ACC77C77AGAAGA7AAG77AG7G777G7T77GACTT777AAAA7TT77A 
CT7CC7CT777CC7TCACAA7C7CA7CCCA7CCCAAGAGG77TATCAAGA 
AG77C7C7AAAGA7A7G7G7CTCC7TA7GGAA77TAACAGAAA7CAGGGA 
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w 1 AUU^ATv vGGGAAT AACATTTTTC CAGGTCTTTAGAC* . 
ATAATGGAATACCTTGCAGTAATTAGATACACTATTGTAGAAAAGtATTG " 
ATGAAATGGAACGATGT7TGAGATATCATATTGAGTAGAAAAGGCAAGAT 
ACATTAAGTAGGAAATG7ATCTTACAAAATAATTTGTCAGACACACTCCT 
ATATTTG7ATGTTATATAAATGCGTATGTGAAGAAAGGCTAGAGGATGAG 
ACCACAGTC77CGGTGAAGTTTAAGAGATGATGCTGCAGCATGCTCAGAA 
AGGCT7GGTATAGTTTTTTCCAGTAATTAAGGAC7GATCTTAGGTAAATT 
GTCCA7CC7C7C7AAAG7GCACCACC7TT7G7C7G7AAAACAGGAAGGA7 
GG7A777ACCCCCAGGG7CA7CAAAGGA777GG77GGAGAAAAA7AAA7A 
AA7GGGC7GAGCCCAGACC7GGCACAG7GAGAGCACAG7GG77GACTA77 
GTGC7GGCC7G77G77CC7G7G77A77GACA7GC7GC7GG7GG7GG7CCA 
GAAGC7A77ACC77AA77GG7TATG7GGA777CCCC7CA7AC7GAGCAGC 
7G7G7G7GG7G77G7AAAACA7AGCCA7ACACAG7AAC7GACAAGGGCAA 
ATG7GATGGAAAAA7GCAAGGAAG7GCAGA7AAA7AGC7AA7GGGC7G7A 
GAAGGAAGC7AG7CC77GGAGGGC77GA7CAAGGAAGG7CC7TT7GCA7G 
7CACCT77GAAGAAGAGGGGACA7AGAAGAGG7A7AG7GCA7CCCGGAG7 
GTACC7GGAAGGGAACA7GAAAAGAGGACA77777C7C7GG6ACATGGGG 
AC7CCAC77GCA7GAAC7C7GGAA77GGGGCAAAGAACCA7CA7GAGAAC 
AAGGGC77CCTTGAACCTCCCAGGCTCATTGGCTGATCTAAACCCTGTGT 
CCCC7C777CC77CAC7C7CC7CTG7777C7A7ACC7G7A77A77GGAC7 
GGAC7GG.AAGCCACCTGATCTATCACAAG7ACCTTGAAATGTGT7GAATA 
GG7G7GGCACAG7CC77AGCAGAG7GGCAC7ACCCCCACAGGAA777G77 
7A7ACC777GGCA7GGAAAA7AGCAGGAAA7GAG7GA7CAC7GA7AAC7G 
AGGA7GC7A777A77A77GGCCAAAGGAA7AC77G7G77G7A777GCA7A 
ACCAC7CACAAAC7G77GA77ACAAA7GAG7ACCAGACC7AGC7CC77CA 
AG7AAAGGA7C77GAGAAC7GAAGGCAAACAGAGC7CCAGGAG7CCAAGA 
CAGAGCCACAGACCACGAGGA7CCC7GGCCCAGG7AGG7GG7CC7CC7GC 
AC7GGC777CAAGGCCAACAGGA7GGA7GGGGAAG7AGAG7AGCATC7GG 
CCA7C7AGACCC77GC77777A7CCCCAC7GGAAGCACA7C7GAATT7C7 
AAA7A7GA7C7C7GAGACC7GCCCAGAACACC77GC7CTCAGCCCCAG7A 
GCAGCC7GC7CTC7CCCAGGAGGGC77CCAC7AACAAGTAGGGCATTGC7 
GGAGGG C CAGGCAGACAC7AGC77AGG AAA7 C CAC CAAC CC7GG AAA7GC 
7AG7CCC77C7C7GAAGGC7CAGAAGAC7GAC77TAGAG7C7AGAAAATA 
77GG7CC77GGGAACAGA7777GAG7GCAAAGAGA7GGAC77CAGA7GGC 
CAGA7GCAC7GC7TC777AGGGAA77C7G7GAAAGC7CCC7GCA7TTATC 
77AA7ACAGGCAGCAGA777CA7GAG7ACCCCCGAGGGA7GGCCCCAGG7 
CC7CCAGCC7G7GAGCA7CCT7C7G7CC77CAGCAGCACCACAG7A7C77 
7A7A7G7C777GGA7ACC7ACG777C7GCCAGACA7C7C77GC7C7GA7G 
77C7GGC7SCCAAA77C7C7G7CAAGCGCC7CCAA7T7777G7G7CCT77 
GA777ACCCCAACA7GACAAAGGCAG77G7GC77CATG7A77CAGGGA7A 
C7GCCAAACCACAAACAGG7TAAAA7CAAA7AGCAGATA7CCC7GTTCC7 
AAAGACCCA7CAGC7C7ACCCACC7GC7CC7GC7CACCG7CC77A77G77 
GAG7CC7GAAGCCCT7CCTTGTCA77777A777777GCA7GAACAATTTA 
G77CCC777G7C7CAC7CC7AAACC777C7CAAAGGA77GGA777G7ACA 
CAAAC7GCC7A7C7C7GCAATCT7AGAAG7GA7ATGAT7C7GAACAAA7C 
AC77AAC7777GA777777A77GG7AAGA7GGGAA7ACCAA77777GC7C 
CAC77C7G7CC7ATG77GGCC7GGGC7GA7G77GAAAGC7C7CGG7CAAC 
7GAGA7AGGG7G7GCAGAATT7A7ATA7A7AAA7A7A7C7CC7CCAACCC 
C7CCCAA7GAAGCAAG7CACG7GAG7CAA7CC7ACCC7AAGA7A77AGGG 
A77GAGCC7CC7GGGACA777GG7GGC77AGG7777CA7GAAAAGAGG77 
GCAGAGCAAC7GC77777G77AGGCAAAGA77AGGC7AC7GCAGAGAC7C 
AGCAAAC77C7A7AGAAGG7G7CAGA7GG7AAG7A7777AGGC777GC77 
GCCAGA7GA7C7C7CAAC7AG7TAACCA7GC7A77G7AGCC7CGAAGCAG 
CCAGAGACGA7C7G7AAACAAGAGCA7G7AG7G77GGCA7AAA7A7AG7A 
CCGCG 
>Concig43 

GCAA7AAG7C7A777AC7G7AAAG77AA7CAAA777ACA777CAGAACAC 
77AA7C7GCAAGAG7CC777CCAAGACCC7A7ACC7AA7777G7G777AC 
AA7777A7A777G7777C77AAAGAAGACCACCAA7A7AAAC7A7ATCCA 
GCC77CA7GA7AAG7ACA7AAGAAAC7A7GCAAA7AAGGGGGAAAAAAAA 
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CAAAGAAAAA7ACC7AL . fTACTAATGGTTCACTTCTGAATAGGATLATx- -4 

TCATAATGATACAAGCACTCATTACTAGTCTAGGAAAATGAAGATATAAT 

TGCATTAGGAAGATCAAGAGGTAGGAAATGTGGATG7GTGTGGTATAGAC 

TAGGGCAGGACAAAGAACCTAAATCCTCATTTTCTAAAGATAATTGTTAA 

TACGTAAAACTCAAAATTCAAGAAGTAACAGTAAAAGCGGTCATTAAGAA 

ACAAGCACTAAACACCAGATAGGAAGCGAGAGATGGGGGAAGAGGGCAAG 

AATCTGATTATTTTTTGCAACAAATTTTGTAAAACCATTTGACTGTTTAC 

AfGTAGAACTTGGATCTTTTTTAAAAAACACAAAATAATAATACTATTAT 

TTTTTAACTGGATTTTTGAAAAAGAAGATAAAAGTCTCATTTTAGTAATT 

AAAACTCATTCCAGGTTAGTCCACTCAAAACTTATATTCGAAAATTAAAA 

C7TTGGGAGGCTGAGGCAGGCAGATCACCTGAGGTTGGGAGTTCGAGACC 

AGCCTGACCAACACGGAGAAACCCCGTCTCTACTAAAAATACAAAATTAG 

CTGGGCG7TGTGCATGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAG 

GAGAATTGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGAGATCACA 

CCATTGCACTCCAGCCTGGGCAACAAGAGTGAAACTCCATCTCAAAAAAA 

AAAAAAAAAAAAAATTAAAACCTCTGGAAGTTGAGTTTGCAGATATTCAT 

TATGCTCATTTTTAACTTGTATGTTTGGAAAATGTCATGATGAGAATTGA 

GGTTGGGGGATGAGAAAAAAAGAAAAACATCAACCCCACAGCCCATTCAA 

TTTTCAGCCCGACCCACAGCTCCGGGGAAGGGCAGCAGGTCCATCCTTCA 

CTCTTTCTTCACCTCTTTCCCCTCCTTCTGGCTCTTCCACCTCTAAGTTG 

GAGCGCAAGAAGAGGCACTGGGAAATGGAAAAGTCTTTTGTACGTGGTAC 

TTGCCGGGGAAGCTGCCATGAAGACCTGGCCCCACGGTGGGGAGGGAATG 

CCCAGCTGAGGCCTCGTGCCCATGCTAGGATAGACTCGTCCAGACATGTC 

AGGTGGTCTGACAGGGCAAGCAGCAGGAAGTCATG7ATGAGTATGAAC7G 

ATCTGTATGCAAGGGCGGGGAGAACACGCGGAGGAATGGGGCGTGAGAAA 

ACAGCACAGTACGTTTCTTTAGCAGCTGTC7CTGCTCAGCCATGGGAGTC 

ACCAGAGAAAGAGGCTTGGAGGCGTTATTTTCACTGTGAGATGTGAGTGT 

AAAAAAGTGCCCAAGACACAGTGAGTACCAGGGAGATGCCCTC7TTCCCT 

ACCCGAATGCAGAATGGCCACAGGCCTTAAAACACACACATGGTTCCTCA 

GAGGAGAGAGGCCTCCACAGTGGACACCCGCATTCTCCCCTGGTCAGCAG 

CAGCAGGGCGAGTGCTGGGCCATCATGAAGCTTCACAGGCAATGAGCTCT 

CAGCAATAACAGGAACAGTGCCTGGGGGACTGTAGCTGCAAGACCGATTT 

TCA7G7AAGA7GGCC7C7GAGGAC7CCGAGA7ACACCAGGC7GAGAC7AG 

CTGGCAGCTCCAAGTTCTTGGTCAGAAGAGAACAGGAACTAGGGAAATTG 

GAATTACTGTTACTACAATTCCTTTACATCCGCACAACCATGAGGTCCAG 

AGAGTC7CTCTTATTTTTTTTTTAAAGACAGGGTCTCACTCTGTCGCCCA 

GCC7AGAG7GCAC7GG7G7GA7CA7GG77CAG7ACAG7C77CACC7CCCA 

GGC7CAAG7GACCC7CC7GCC7CAGCC7C7CAAG7GGC7GGGACAGCAG7 

TGCA7GC7ACCAGGCC7GGC777777777777777777777777777777 

TCGG7AGAGAC7GGG7C7C7C7G7A77GCCCAGGC7AG7C7CGAAC7CC7 

GGGC7CAAG7GA7CC7C7GGCC7CAGCC7CCCAAAG7G77GGAA77ACAG 

GCA7GAGACAC7GCACCCAGCCAG7A7AG7C7777AACAGC777A77GAG 

G7ACGGC7AACA77GAAAAAAC7ACACAAA7G7AAAG7A7GCAA777GA7 

AA7777GACAAA7G7ACACACCAG7GAAAC7A7CAC7ACAG7CAAAA7AA 

7GAACA7A7CCA7CAC7CCCAA777CC7CACGCCCC77GG7AACCCC7C7 

C7CCCAAC7CCC7GCCCCC7AACA7CAGACftAC7AC7GA7GCA77C7G7C 

TCCA7AGGC7CA777ACA7777C7AGAA7777ACA7AAA7AAAA7GACAG 

AG7A7A7AC7CC77CA7G7A7GGC77C777CAGCCCAA77A7G7CAAGA7 

TCA7GC77A7GGC7G7GCG7A7CC77AGCCCA7C7C777G7C77GC7GAG 

7AGGA7ACCA77GCA7AGACAGACCACAGC77GC7CA7CCA77CAC7C77 

GACAACG77GAA77G7C7C7G777777GCAA7GACAAA7AAGG77GC7A7 

G7ACA77CC7G7A7AGACA777G7AAAAGCACAGCA777CA777C7C77G 

GG7AAAGACC7AAAAG7GGAAAGGC7GAG7CA7A7GG7AAA7A7A7A7G7 

C7AAC77777AAGAAAC7G7CAAAC7G77ACCCAAAGGGA77G7ACAA77 

77ACA7CCCCACCAGCAG7G7A7GAAAA77CCCG7AC77CCACA7CC7CA 

CCAA7A7A7GG7G7GG7CAA7C77777AA7777GGACA7GN7AA7GAG7G 

CAAAA7GAGGCCCAGAG7G7C7GAAG77ACA777G7A7CC77777GGCA7 

CCAAAACAGG7G7CAAGCA7AGAAAAAACAC77G77CC77GAA7GG7CAG 

TCA777ACAAG7GGAA77CA77ACAAACCGG7AG77C7AC7GGG77AAAC 

7A7GCC77AC7G7CAACAGGCACA7ACACA7ACAGACAGACAGGAAGGCA 
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CAGAGACAAGGCAGAGC . _ TGATAAGAAGGTGACCTGGGCTCTAGCTC1 w 

GCCTA7CACCTAGTAAAATATTAGTTAAG7AGCCATGAGTAACTCACTTA 

ACTTACCACAGGCTCCATTTTCTTATCTGTAAAATAGGAACATTGAAACA 

GCTAATCCCCAAGGTTTG.TGGATAATCAGAATTACAAAGATCAATGACAT 

TTCTATGAGAGAAACATATTTCCAAGTATTTGATGGAGTACATCAGACAC 

AAAGGAAAGGAAACTGAATATTrrTGAGGTTrTlTlTTTTTACCAAGAAA 

TTCACATTTTGTTAAATTTTCAGAACTACCTCCTGAGGAAAGTGTAGCTG 

CACCCATTTAGAATGATAGAAAACATCAATCTGTCTGATTCCAAAGCCAA 

GTTCTTGCTACAACGAGAAATGAAACAACTGGATCCCTACAGATGCAGAG 

AC CTGGGCC CCA CAAATGTGAATTCTGTTCCCCTACCGAATAGAGTTACA 

GTTCCATAATACAGTACTCCCTCACTTTTCCACAGTCTCACATTCCACAG 

TTTCAGTTACCCACAGTCAACTGCAATCCAAAAATATTAATGAAAAATTC 

CAAAAATAAACAATTCAGAAGTTTTAAATTGTGCTCCATTCTGAGTAGCG 

TGATAAAATCTTGTGCCACCATCCCACCTGTCCAGCTTATCGTTAGTCAT 

TGACATCGTCTGCTCCTGACATCCAACCATTGACATCATCATGACTCTAT 

GATCCAGGATCACCGAAGCAGATGACCCTCCTTCTGACATATCATCAGGC 

CAATATCAGCCTAAACACTGCATCACTATGCCCACATCAGTCACCTCACT 

TCATCTCATCAAGGAGGCAATGGATCACCTCACATCATCACAAGAAGAAG 

AGTGGGTATAGAACAATAAGATAATTTTGGGGCAGGCATGGTGGCTCACG 

CTTGTAATCCCAATACTTTGGGAGGCCAAGGCAGGAGGATCCCTTGGGCC 

CAGGCATTCAAAACCAGCCTGGGAAACATAGTGAGACCTGGTCTGTCTGC 

AAAAAAAAATAAACAAAATTATCCAGATACAGTGGTGCATGCCTGTGGTC 

CCAGCTACTCAGGAGGCTAAAGTGGGAGGATCACTTGGTCCCAGGAGGTC 

3AGGCAGCAGTAAGCTGTGATCGTGCCACTGCACTCCAGCCTGGGCAATA 

AAGTGAGACCCTGTCTCAAAAAAAAAAGGTAATTTTGAGAAAGAGACCAC 

ATTCATACAACTTTTATTATAGTATATTGTTAGAATTGTTCTATTTCATT 

ACTTATTG7TGTTAATTTCTTTCTTTGCCTAATTTTTTTTTTTT7TTTTG 

AGTCGGAGTTTCACTCTTGTTGCCCAGGCTGTAGTGCAATGAGACGATCT 

CAGCTCACCGCAAATCCCGCCTCCCGGGTTCAAGTGATTCTCCTGCCTCA 

GCCTCCCGAGTAGCTGGGATTACAGGCGCCTGCCACCATGCCCAGCTAAT 

7TTG7A7777TAGTAGAGGCGGGG777C7CCATG77GGTCAGGC7GG7C7 

CGAACTCCTGACCTCAGGTGAGGCCTCAGCCTCCTAAAGTGCTGGGATTA 

CAGGCTTGAGCCACTGCGCCTGGCCTCTTTGCCTAATTTATAAATTAAAC 

ATTGTCACAGGCATGTATTAATTTATAGGAAAATCATAGACATATAGAGT 

7GGGTACTATCCACAGTTTCAGGCATTCACTGAGGGGCTTGGAACACGCC 

CTCCTCAGATGAGGGGGGACTACTGTCATCTCCTCAATCATTCTTGATTC 

AATCCTCAACACAAATGGTTTGGCCAGGTCTTGCCTCTGGAGACAAAATT 

GCTAAGGATTTAGAGGGGAAAAAATG7AGTTCACTGGGAAAGTCACCTCT 

3C7CCAC7GGACAGCAAC77AAAACCCAGGCCA7GACAAG7AGAAAGGCC 

ACCCCCAC7C7CC7TCACACC7GGAG7A77CAGGAG7CAA7CA7A777CA 

GGACCACCAGGAGCAAAC7GGGAAAAAC7GAGC7GCC77GAGGAAAGCAA 

7CAGC7 CCACAAGGGGC77AAGAAACAAGC7C7GGGAGGAG7GG77GGAG 

AAGAG77GGGGACACATCAGAAA7GCCA7CAAA777C7AAGGGC7ACC7C 

G7GG7GTCAGACC7G7GCA7C77CAAGGACA7AAACAGA7GGGA7AAGCA 

GA7GAGA77CACAGAGGACA7CAAAA7A7TGGCTCCCCAGAAGGGAGAAC 

A77C7AG7AACAGAGCTGCCCAGC7GCAGAGTGGAC7GTTTCACAAAGCA 

ACAGG7GCCCTGCCTCTTGAATCACCA7CTTCACAGGAATGCAG7AGAAG 

GGAC77AAC7CCTGCCC7GAAGAAAAGG77AGGC7AGGGAAACAGC7CCA 

AAA777777AAAAGGAAGCAACA7AGGCA7CTAC7GGGAG777TC7AAAG 

CC777G777AA7GAAAC7AAAGAGC7GGGACAGGAAA7GCCAAA77AAA7 

7AA7AGAGCC77GCT77AAGACAA7GCAAG7GGA7GG7AA7GAAGGAA7G 

AG7C77AGGCC7TGGATCAACCG7A77AAGCAA7GC7GAGCA7GGAGCCA 

A77C7G77CAC7AGATT7GC7CAGAAAGGGCCAGACGAGAAGGA777T7C 

7AAAGGCACCTACTACCAAAAAGC7GCCAAGGCG7CCAA7GGAGCCCAGA 

GAGAA7A7GC7AACAATAAAAAG77GAACACCC7CAA7AAAAAAGGGTAA 

AAG7AA77AA7AGAAAA77AC7GAAAGC777777GAAACCAAAAG7AG7C 

AGCA77GG7AAAAGTCTACAAAAG7GGACAC777CA7A7AA7G77GGCAG 

GAGGG7AAAAAGACA7AACC777TTGGAGGACAA777GGCAACAGAG7AC 

CAAAAACC77ACAAT7GAAGAGAAC777GGCC7GAG7GCAG7GGC7CACA 

CC7G7AA7GCCAACAC777GGAAGGCCAAGG7GGGAGGA77GC77GAGCC 
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CAAAAGTTTGAGACCAGLCTGGGGTAACACAGTAAGACCTCGTCfCTATu 1 

AAAAA7AAGAAAAGTTAGCTGGGCA7GG7GGCATG7GCC7G7GGTCCCAA 

C7AC7TGAGAGAC7GAGGCAGGAGGA7CGC7TGAGCC7CGGAGGTCAAGG 

CTGCTG7GAGCCATGT7CATGCGACTGTTCTCCAGTCTGGG7GACAGAAT 

GAGACCCTGTCTCACCAGAAAAACAAGGCAAGAGAGAGAGAGAGAGAGAA 

GGAGAGAAAGAAAAGAAAGAAAGAAAGAAAGAAAGATGGAAGGAAGGAAA 

GAGAAGAAAGAAAGAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGA 

AAGAAAGAAAGAAAGAGAAAGAAAGAGAAAGAAAGGGAGAGAAAGAAGGA 

AGGAAGGAAAGAAAGAAAGCAAGCAAGCAGGAAAGGAAGGAAGGAAGGAA 

GGAAGGAAGGAAGAAAGAAAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAA 

GAAAGAAAGAAAGAAAGAGAAAGAAAGAAAAAGGGAGAGGGAAAGGGAAA 

AGAAAAGGACAAAGAAAAGACCTTTGAACCCTGAATTTCACTTTTAGAGA 

TTCATCTTAAGGAAATTCATTCCAATAGAAATTTATCCCCAGGATTATCT 

AAATATTTGC7TTTATTTTCTTCTAGTAATTTTATGGTTTAACTTTCTCA 

TGTTTAAGCCTTTAATTTATTTGGAATTTATTTTGGTATGAGAAAGTGTG 

ACCTTTTTTTGTTTTACTTTAAAAAAAATGTATTACGATTAXTATTTTAG 

AGACAGGG7CTTGCTCTGTCACCCAGGCTAGAGTGCAGTGGTGTGATCAT 

AGCTCACTGCAGCCTTGAACTCCTGGCCTCAAGCAATTCTCCCTCTTCAA 

CTTAGGAGTAGCTGGGACCACAGGCATGTACCACCATGCCCAACTAATTT 

TTTTTATTTTTTGTAGAGACAGAGTCTTGCTTGTTGCCCAGTCTTGCAAT 

GTTGTCTCAAACTCCTGGGCTCAAGfcatCCTGTCGCCCCAGCCTCCCAA 

AGCAC7GGGATTACACGTGTGAGCCACTGCGCCCAGCTGCCTTTTTATTT 

TTTAATTTT7CAGATGCTTTGTTGG7TCCAAAATAGCAC7TATTAACCCA 

CGC777CCCCC7C7GG7TTTAAA7AC7GCAAG777GGC777GAAA7ACAA 

CCCAC7GCC77A77CAGGC7ACAT7CAAGGAAA7C7GAGACCAAGAG7C7 

GAAGGCCCAG777CCT7CCTCAAACCCAGGAGG7GG7AAA7G7GTCAC77 

CCACAC7T7C7A7CTATTTCTAAGAAC7CC77C777CCAAAC7CTGACA7 

GCCCC7GGC7CAGG7CTATAGAAA77CCCAGGG7CCACAGACAAAGCAGA 

ACTCAC77A7GGGGAAA7C7GGGAAA7AC77A7C7G77AAACC7GCCCCA 

TATGG7GAC7CAGAT7GTCTAAAGCCCAAAGCATCA7T77CCACCCCAAA 

CCAT77CC7CCTCCAGACTTCTCTA777C7G7GG7CCAGAG7CAAGA7C7 

7GATA77ACCC7AGAG7CCCCCT7C7GC7C7CC7GCA7ACCCAGA7GCCC 

CTCCC7CCCCAGA7CCA7TCTCCCACCC7CCC7CCCA7CAG77TGG7GGG 

CCCA7CACCGC77CCCC7GGCCCAGGC7CTCC7777G7GCGC7TGGAGCA 

GCAGAC7GA7C7CCCAGCC7TCAC7CAC77CA7G7GG7AA7C7G77G7G7 

7CA7CAC7G7CAGAA7CTTCTGCA7CCCCTCAC7AC7C7GC7GAAAACAC 

7C7AG7GG77CC7CA7TGC7CA77AA7GAAAG7C7AGA7A77AAACG7AG 

AAGGCCCAGCACAAT7TGCCCC7A7GCCACC7ACC7C7C7AA7C7777C7 

CC7TAC7C7GACAGAC7C7CCG7C7G7CA777A7G7A77C7777A77GC7 

C7CT7C7AC7777AG7A7GAAC7GGA777A7GGA7777777AACA77GC7 

77CAAG7A7GGAA7AAAGAAT7T7A777A777A777A777A7TTA777GA 

GAC7GGG7C7CAC7CTG7TGCCCAGGCCAGAA7GCAA7GG7GCAG7CA7A 

7CTCAC7G7AACC7CGAATTCCTAGGCTCAAGCCA7CCTCC7GCCTCAGC 

C7CC7AAG7AGC7ATGACTACGGG7G7GCATCACCACA7C7GGC7AA7GG 

AATAAAA7A77ACAATGCCTAA7C77AA7TTTCAAAA77T7AAA7TACAT 

7GTACC7AA7GCCCATGCATTTAC77TTTTCAG7GGG7CAA7AGCCCTCA 

C7TTGGCAAAGG7CCCAGGCCCAAGG7AAGGCC77AC77777CCAAAC7C 

ATC7777GAAAGACA7AAGTGCC7G7AAGT7G7ACCACA77AGG77C7AG 

GAA77777CA7CAAAGACT7TA7CAGACTA7777CC7C7AAG77GAGAAA 

GAGC7GGGGGCAGAATA7GGCAC7GAA7GAC7GAAGAGAAGGCACTGAAA 

7CAGGCCAGAGG77GC7GGAAAGAGCAA7GAGGAACACCAGCAGCAA7GA 

GGAGCCGG7GA7GA777TGGC77CACAGGGAGG7G7G7ACCACACCGAT7 

77A7C7C7ACG7GGA7GAACCACAGC7G7CGGC7CCC7TG7C7CCAGGAC 

ATCACAC7C7CCACATTCCCTCCCA7C77CCGGC7TCTGC77CCCGGGGC 

CCTCA7C7GCCCCATCC7GGGTGAACAC7GG7CGG7CAAC7GCTGGGCG7 

ACC77CCCGC7CTGCACACCC7CCC7GGCCACCCCACCCAC7CTCACGGC 

7CGCAC7GCAGAGGAGCCGCA7C7C7AGC7CCAGCCCA7C7GCCTC7TC7 

GAGC7C7AAC77CA7GTAGGCGAC7CC7GCCGG7G77GCC7CACAGGCCC 

A7CA7AC77CAAAGCATTT7CCCC7CAGAACACCA7G7CC7GGC7GC7CC 

C7CCAGAAGA7ACA7CTC7CAAGCACA7CCCCGCGGC7C7CACC7GGA7G 
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ACTGCA77CACC77C7C JACA777GCCC7CC777GGA7G7A7A7AGA . 

G7777AAAA7ACAAA7C7GA7G7GC77GC7C7CC7GC77GAAACACC7CA 

AAACTGCCTTCAGGATAAACCZACTGCCCTTGACATGTTCACAGGTTGCCC 

ATGGCCTGGCCCTGCCCATCTCTTCAGCCTCATCTCATGCCCCTTGCCCC 

TCGC7CTCTGGGCTTCTGCCTCCCTAGCCCTCCTTTAGGTTCTCTAACAC 

ACCA7AG7CC77C7AG7G77GGGGCC7C7GCAAG7GC7G77CCCA77GCC 

TGAGACATGAATCCCTCTCCCTATCTCTACCTGCACCTTCATCTGATTAA 

T0CCTACCC7TCCTACTCATGATGTTGC7TTCTCAGGGACTCTCTCTGAC 

TTTTTAAACTAATCAGGGTCTCCCCAGTATATATCTTCATAGCACTCTGT 

ATTAC7CCTTTCTTAATGACCACCTGCTGTAGACTGAATGTTTGTCTTCC 

7CCAAAA77CA7A7G77AAAACC7AGCCCCAAA7G7GA7AA7A777GGAG 

GAAGGC7C777GGGAGGCAGAGCCC7CA7GAA7GGGA77AG7AGCC77A7 

AAAAGAGACCCC7GAGGGC7CCC77G7CCCC7CCACCG7G7AAGGA7GCA 

ACAAGAAAG7A7GG7C7A7GA7CCAAAAAGCAGACCCT7GCCAGG7ACCC 

AA7A7GC7GGCAC77GAAC77CCCAGCC7CCAGAAC7G7GAGAAA7AAA7 

77C7A77777CA7AAGCCACCGAG7C7A7GG7A7777G77A7AGGAGCAC 

AAACAGAC7GA7G7GCCACCCAACCA7GA77A7ACG7G7AAT77A7GG77 

7C7C7GC7AG7AGGGA7GCACCA7GGGG77AGGAACCACGC7777C77A7 

77CCCACACAG7CC77AGC7C7AAGCA7G77CC7GAA7CAAAGA7CCCCA 

7C777TA7GAA7GAAGGAG7CAG7GAA7GAA77AA7GAAAGAAC7GA7AA 

CCC7CAA7M7TA77CCAGCC7777A7ACC7AC7A77AACAAGe77GCA7 

7C7AC7CCAAA777A77GGGC777AAC7C7A77777GGCCAGCCAdA777 

GACA77CCC7GAAG7AAA7C7A7GC777CCA7CC7AAG7CAAGGAAGGAC 

C7GGAC7AG7AGGGCCAAGAAAGG7C7AAA77CCA7GGG7GGGAGAGAGA 

GAC7AAA7C7GAAAGGAAGAA7AGA77GAGCAAAGG7G7AGAGA77GGGG 

AAGGC7GGACA777GGAGAGAAGGAAAAGGAAAC7GACAC7AAACCAAAC 

AG7C7CACAAACACAATC7CATCC77CCAAAAC7C7G7GAAG7AAGAA77 

AC7A7CCCAGGGCCAGGCACAG7GGCCCA7GCC7G7AA7CCCAGCAC777 

GGGAGGCCAAGG7GGG7GGA7CACC7GAAG7CAGGAG77CAAGACCAACC 

7GA7CAACA7GG7GAAACCCCATC7C7AC7AAAAA7ACAAAA77AGC7GG 

GCA7GG7GG7GCACACC7G7AA7CCCAGC7AC77GGGAGGC7GAGGCAGG 

AGAA7CA777GAACC7GGGAGG7GGAGG77GCAG7GAGCAGAGA7CG7GC 

CAC7GCAC7CCAGCC7GGG7GACAGGGAGAC7CCG7C7CAAAAAAAAAAA 

AACAAAAAAAAAACCAAAAAAAAAACAAAAAACAAGAA77AC7A7CCCAG 

T777GCAGA7GAGGCAA7GGAAGC7C7AAAAAG77AAG7AGGAGAAACAA 

ACA7GAAA7G7A7G7C77ATGC7777CC7CA7CC7A777CC7CAGCC7GG 

AA7G7CCA77C7CCC7CCAC7A7GCAAA7C7AAC7C77CAAGC7AACACA 

7AGCAA7G7C7GAGAAACCG7CCC7G7G77CAC7C7G77AGCC7CAC77G 

C7CCC7CCCCA7CCC7C7G777CC777C7G77A7AACAC77C7C7A77C7 

GC7GGCA7CACAG7CA7C7CCACC7GCC77CC7CACAAG77AAAAGC77G 

77AAGGGCAAG7GG7G77C777GCCACCTCA77CCCCAGGGC77C7AACA 

CAG7GCC7CA7GCA7GACAGAG77G7AAAACAGG77ACCAAGC7GGC77C 

AGGCAGG777GCA7GGAAC7GTGC777ACAGGAA7ACC7GCTCCCCCCAG 

GCCC7GGG7C77CC7CCTGAGTCCAGGC7CAGAC7C7C7CA7CC7GCTCG 

T7C7C7C77GGGGAGCCACAG7AAC777GAGCAAC777GCA7GGGA7AGA 

A7GGCC7A77AGGGGCAGCACAAAGACCCCA7GGAGGGAAGAG7ACAGAA 

AGGGAAAACGA7AA7CA7A7777777AAGA7G7GCA7777C77AACAAAA 

TGCTCTAGTACTTGTCCAGACTTTCAAACTCAAAAACCTAAGCGTCOOT 

7C77GAAGA7CA7CAAAGGCCCCAG7GG7CC77CAGG7A7G7CAAGC777 

C7AGAAAA7AAAGG7AAG7CA7AA7CAC77AACACACA7GGC7AAA7GGC 

CA777CC77C7AA777A7CAGCAAC7G77ACA7A777C7A7AC7AGAAAA 

AA777A7A777ATAC7CAGGG7GG7AAG77AAA777GCCA7CGAAG7AAA 

GCAGAAAGAGCG7AGCA7G7A7G7A7A7G7AAC7CAAC7G7GCA7GAGAC 

AAAGA7G7C77GAGGAGAA7GAG7C7AAGA7GCGCC7GAGCAA7AG7ACC 

C 
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>Concigl 

G CAC C CAT GTTT CTAAAGGGCATAC CAG C CAT AAT AACAGGATGGGTGAG 

GATATAGACAGCAGATGACAGAGAGGAGAGTGAAAGCTGGGAATCCCAGC 

TAAAGGCATCAGGTTTAT.GGAATGAGTAGGGGACAATACTGTGTGTGTTT 

ATACACACATGTATATGTGTGTATATGTATACATGTTTATGTATATATAT . 

AATTATATGGTACCATTTCTAATTGACAAAATAATCTATCACATTTTACA 

TTATCAGATTTTACATCTATTGTTCTAAATACACTCAGTCATCAGCCCTG 

TGTGTGGGCTCTTACCCATCCCCATGCACACCTCAGCTCAACCACTGATG 

GATGGATCATCTGCCTATCAGAGGTGGCATATTCAGGTGAATCCATGGCC 

ACAGCTGCAGCACTTCCTACCCACGCAGAAAGGCTCCACAAGAGGAGGCA 

CACCCGCTCTGACTGTCCCTAAGCTCCTGACATCTTCACCCCATGAAACT 

GCTGCTCCTGGGTGCTTCCTGCCTTGCCCTGCCCACCCTTGTACTGTTCT 

CACCATTGACACAGCTGGTGCCCGATGCAC 

>ConLig2 

NAAAACGAATCGTCACTATTGAAGCCTGTCTCTCANCGGATCGTGACTAA 
GAACCCCCTCCTTGCTTCAAGTTGTCCTGCCTTTCTAGGCAGAGCCACCC 
TACATCTTAAATATATTGATTGATGACTTACGTCTCCCTAAAATATATAA 
AACCAAGCTGTGCTCTTACCAACTTGGGCACATGTGGTCAAGACCTCCTG 
ATGCTCTTGTCATGAGTGGGTGGGTGTTCTCAACCTTGGAAAAATAAACT 
TTCTAAATTAACTGAGACCTGGGTCAGATTTTTGGGGTTCACAGCAACAA 
^TTTAAAAAACTCACCATTGACCTGAAATTTTGACCTTATGCTGTTGCTCA 
CACTCCTCCATGAAAATAGACGCCATCCTATGAGTTCCCTCAGCCATGTC 
ATGCCACACTTCCAACATGTGTCCCCATCCACCATCTGTCTTCTTATTGC 
TGCATCCTACCCAGGCCCTGATCTCTGGACCCATTGTTGTATAATTAAGA 
ATTTGGGGCTGGGCATCGTGGCTGTGGCTCACTCCTGTGATCTCAACATT 
TTGGGAAGGTGTATTAGTCAGGATTCCTCCGAAGGATGCAACCCTAGGGA 
TCCTCTCTATGACCCTATGTCTA 
>Contig3 

CGCGCTCAACCGACCGATTTGCGCGAACCTGCCCATGCCCGAGGACAGTG 

TAATCCTAAAACGTCCCCTGAATCATAAGGATATGAGTGCGAAAGTACGG 

TTCCCTCTGTCACCACTTTCTAACAACGCTATGTCCGATCCGTGCACTAA 

CCCCGCCCAAGTCACTGAAACACTGATGGGCGCTTCCTCTACAGGTATCC 

AGGGCCAATACCACTACTCCCCTCCTCCCTGTCCCCCTTCCACTCTCTAG 

AGGCCGCGGATGCCATCCTCTATTAGCACAACCGAAAACGACGGTGAAAG 

TACCACGAAGCTCACGATCTGATCGGTCGCCCAATGCGGTTACAACGGCT 

GTCATCCCAACCCCCX5TCCCATCCTCCATATTGCCCCCCCCTATGAGGAT 

GGCCCTATCATCATGACCTCCAAAATTCTGTCATCTCCCGACGTAATGCC 

GCCCCTCGAACGCCTGACACCATCAAGTCNGTCACCTCCCAAAATACTCC 

TCCTAATCACCAGGCCGAGTATCCCCGGTTCCACAATACCTCCTTGAGAC 

GGG C C GAT AT C ACACAC 

>Contig4 

NGGAGTTTAGGTCAACTAGTAACAAGTGGGATTTGCGACTCAGGTCTATC 

TAATCCTCAAACCCACGTCCTGGACCCCTACACAGACTGCCCTCCCTCAG 

TCCTCTGTGTGGCCTCAAGAAGGGTCTGGACATTCAAGTTTAAAAATCCA 

TCCAAAGAATCTATGGACCCAGTGGTCTCTGGAGTCAATGTTCTGAGGCT 

CAGAAGGGCCAGGCAGGAGGGAGCCGCCTCTACACAGTCCTGAGCAGAGT 

GGGCTGTGTCCCGGCACAGCAGGGGAGATCATAAACAGAATTCTGCCCTG 

GGCCCTATTTAAGTAGGACCTTTAGGCTGCCGGTGTCATGACCACAGGTC 

CCANGTCTGCACGATTGGCTGTGTGTGGAAAATCTTCACTCCTTGCGGCC 

TTGTCCTTGGCAGAGAGCACCGCTGCTTCCTCTGATGGCCACCAGGGGGA 

GGCGCTCCCCTGGGAACGGTTTGAANGGGAGCCTCACCCCACACGTGCCT 

TCCGTGGTACCCAGCACCAGCTGCTACCCATGGTTACCCACAGGCCCAGC 

TCTGCTCTGAAGAAGGAGGAGTGGTGGCGATCANGCCTTGTCTGCATCCC 

GTGGCTGCCCCTTTCTTTTCTTT 

>Contig5 

GGGAGCTAACCGCTCACTGGGATTACAGGTACGCACCACCACGCCTGGCT 
AATTTTGTATTTTTAGTAGAGACGGGGTTTCTCCGTGTTGGTAAGGCTGG 
TCTCGAACTCCCAACCTCAGTTGATCTGCCCGCCTCAGCCTCCCAAAGTG 
CTGGGATAACAGGTGTGAGCTACCATGCCTGGGCTTATATGTTTCTAGTC 
CA^CATTTAGCTACCTTTTTTTTTTTTTTGAGACGAAGTCTCACTCTGT 
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TGCCCAAGCTGGAGCACAGTGGCACAATCGTGGCTCGCTGCAGCCTCAAC 

CTCCTCAGGCTCAGGTGATTCTCCCACCTCGGCCTCCCTAGTAGCTGGGA 

CTACAGGTACGCACCACTACACCCTGCTAATTTTTTTGTTTTTGTATTTT 

TTGTACAGATGGGGTTTCTTCATGTTACCCANGCTGGTCTTGAACTCCTG 

GGCTCAAGCAATCTGCCTACTTCAGCCTCCCAAAGTGCTAGGATTACAAG 

CATAAGCCACCATACCCGGCCTACCTACTTTTAACTTGTGGAATTTTCTA 

TAAGGTCANGGATGCCTGNGGGAACAAAAGTTTCTCCCTTGGTATATGCA 

AGTAAAATCCACATGCTGCCTCCC 

>Contig6 

AGGACTGTAGCTGTTGTCTAGTCACCAGGCTGGACTGCTTGGCATGATCT 

CAGCTCACTACAACCTCCACCTCCTGGGTTCAAGGGATTCTCCTGCTTCA 

GCCTTCCAAGTAGCTGGGATTACAGGCATGCACTACCATGCCCGGCTAAT 

TTTGTATTCTTAGTAGAGACGGGGTTTCGCCATGTTGGCCAGGCTGCTCT 

CAAACTCCTGCCCTCAAGTGATCTGCCTGCCTCGGCCTCCCAAAGTGCTG 

GGATTACAGGCGTGAGCCCCCGGCCCACATGTAAAAGTTTATATCTCTGT 

TGTTTCACCTTGTTTTTGACCTAGTCTTTCAGTGATTTGAATCTTGATTC 

AGTCTTTTGTTATTTTAGTGGTACTTCCCAGCTTTGTGTCATCTGTGGAT 

GACATATGAGTCTTGCTTCTTCATGCCAATTTAAGAAGACTGAACGGGAA 

TAGGTCAAAGGCATGGCCATGAGCGATTTCTCTCCAGCTTTTCATGGTGT 

TCAGCTTCAAATCTATTCACATATTGGAGCTGCAAGCCATCATCTTATCC ~ 

ACAGGCTATCATCATAGGTGAATGTAAATTGGGTTTAGGTGGCCXAGCTG 

AACG7GAGATATNTTC 

>Concig7 

AGCATGTTCTC7AAAGGCCTATCAAAGCTGACATCAAAGGGATAAGTTCC 
AGTTACCCAGCTGAAGGGAAGGAGGGTGTTTCAGATAGAGGAAGGATAAG 
CATGACCTATTCAAGGCCAGTGAAAGAAGCGTGCAACGGCCAAGTCAGGA 
GAACCTGAAATTGTGTCAAAGAGCTTGGATGCAAAGAGCCGTGGGAGACT 
ATTGGGGGTTTTAAGCAGGGATATAATATTCATTCAAGCATGCAGTAAAA 
GGTCACTGGCACCTGCCATGGGCCAGGACTCGGGCTCTACATGATTGCGT 
CTGTTTTGGAAATATCACCCTGGCTGTGAGATGAAGAACAGGTAGGAGGG 

TGGACTGTGGCAATCACAGAGGCAGAGGATATAAATGCACAGAGACACAA 

GGCATGTGGGAGGCAGAAGGAATCAAATACAATGAGTGATCAGATGTGGG 

GTTAGAATGGTGAGTGANAAAGACATACTCAAGGTGACACGCCCAGGTAT 

CTGGGTGGATGGTAAGACATTCATGGACTAGAATCGAAGAGGAGGTGGGG 

ATGGACATTCCTTCCGTTTAGAGGGGTTCACCAGGAGGATTTGCCGGAAC 

ATGGAGAGGATTAACCAGGAATCCGGTGCCTTTTTCCAAACTGGGTTGGA 

GGGG 

>Contig8 

GGTGAATGCTTTGGCACGCTGTGTAGATTTTAGGTGACGGGTGGTGACAA 

TGAGTCCGTGTCGAGCGCTGATTTTTTCGGCCTTTAGAGCGAGATTTATA 

CAATAGAATTTGGCATGAGATTGGATTGCTTTTAGTCAGCCTCTTATAGC 

CTAAAGTCTTTGAGTGACTAGATGACATATCATGTAAGTTGCTGATAGGT 

TTCCAGTTTTCCGCTCCTAGGTCTGCATATTGTACTTTTCCTCTTACTCG 

ACTTAACCAGTACCAACCCAGCTTCTCAACGGATTTATACCATGGCACTT 

TAAAGCCAGCATCACTGAGAATGAGCGGTGTGGTGTTACTCGGTAGAATG 

CTCGCAAGGTCGGCTAAAATTGGTCATGAGCTTTCTTTGAACATTGCTCT 

GAAAACGGGAACGCTTTCTCATAAAGAGTAACAGAACGACCGTGTAGTGC 

GAATGAAGCTCGCCATACCATAAGTCGTTTTTGCTCCCGAATATCAGACC 

AGTCAACAAGTGTCAATGGGCTCGTATTGCCCGAACAGATTAAGCTAGCA 

TGCCAACGGGATAAACGAGTCGCTCTTGGTGGAGGG 

>Contig9 

GGGGTGGGGCGCCTGGTGTTTCTAAAGAGGATCTCCTGCCAGAAATGGTG 
TGCTGACACTGTTGTCCTCCTTGGTGTGGAACTTTGGTGGGAAGAAAGGT 
TGGAAAGGGAAATTTTGATCCTTGGATTTAACCCGAGTTTGTTACTGATG 
CTCACAAGACTAGGGGAAGGATAAAGGCAGGTGAGTCACTCTAGGATGGC 
TCANTGAGCTCCACAGAGCTGGAACCACAGGCACCAGGAGGGATTCAGAG 
CAGGCCTCAGTGCACGTCAGCTGAGTGAACCAATGAGCAGGTGATGGGTC 
CAGGCAGAGCCCTGTCCTCTTTAGGCAAAAACCC7TGAAACACCGTTCCC 
ATCCTAGCCTGTGTTCCACCCAAAGCTGGCCAGTCTCCAGGCCCTGCCTG 
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AGCCCCAAGGAAGTGGTATGGTGAAACAGAAGGGCCATTCCTGTCCAATG 

TGTGAGGAACTTCATTTCAGACTTGTTGGAAGCCCTGATGTTCAAAAACC 

TCAATGATATCATTCATTTTCCCCATCCATTCAATGCCCATCCAATGCCC 

ATCCGTTCAATGCCCCTTCCATTCCTCTTCAGGGAAATGAAAATTGTTCA 

GAAATCCTTTCTCTTTCGAGAAACCAACCAAACCAAAACCGCGAAATTCA 

CTAAACTAGCCAAGACACAATCCTGGGTTATTTTCCTTTTCCCAAACCTC 

CTeTGTTTAAATTAATTCTACCCTGGTTCTCGGCCCTTACTGCGAAGGTG 

AACTCACCTAACCTCTCCCAAACAGAGAAGAAACTTCTCTTGGTAAAATG 

GGTTTTAACACTTCTAAAAAACCCCC 

>ConciglO 

GCTATGGTTCTAAAGGTAATGGACTATGGCGTACACAACGTCTCGCTCAT 

CGTCTGCCAGGAGGCTAAGGTATCCACGGACAATCGCTGAGCAACAGTGT 

CGTTGATCCATCTCTGTACGCACTTGTCAACATGGCAGGAGTACGGGAGC 

TGCGAGAATCCTCTCTGCTGATGTCCCACGGAGCATGCCGTGAGACAACG 

CCACGAACGGCCCTCGGAGANANCTACTCTGCAATGAAGACGTACGATAC 

ACACGTAGGAGTCCTAGCTCACCAGCCGTATCTAGGTATACTGTACTCGC 

GGATACTCACTCGTGCATGCGGCAATAGATCGATACGCAGTCGTCACGCC 

CATGCTCTCAGTGTGTGACCTTCTGGCGGTAGCGTNGTGGGCGCTATTAC 

TGTGCGCAGCAGGCGCNTCGTACATGTGTCGGGTAGCGATGCCAGGAGCT 

GTAACATAGCAAGTCGCCCCCCTACTCCTATCACTATCCCTACGCTGGAG 

CGCACTCGAGATCTGAACGCACGTCTTAACCTGCCAGTACTCGTGAGACC 

TATACTGCGCAAGCCTTGGCTAGGAGATCCTGCAGCGCCGGCAAAGAATC 

AGCTATGATCCCCTTGCGATTATCGCACACGCACCATAGAGTATGTGCAT 

ATTAACCTCTGAATGTGCTGCAAGCAGACGGTTGCTCAACATATATATGG 

ATGTGGGGAAATCGCCCTGGTCACCGCCACTTGGCGTCAGGAGGCACCAG 

CACGTCTGAGTGTCACGCACGTTACTC 

>Contigll 

GGCCGAATGGTGAATTCATCCGTCGTCTCGAGGGGGTGAAAGACGGGGAG 

TTATGCTGTAATGGCACCGCTCACCCTGGGCTTATGAGCAGACCTAACCC 

TCCCANAGTGCTGGGATTACAGGCATGAGCCACCGTGCCCGGCCCAGTAT 

CTGAACTTCTGTGGCCAGGCAGAAAAGGTCCTGTGTTACTCGTCTCCTTT 

ATCATTCATGTCCATATTCTCCCATTTGCTAACATTTATGTTTCTGCTCC 

ACTGGATTCTTTGGATTTTTCTAGAACATACCCATGCTTTGCATTGCCTT 

GGTCTTTGAATATTTGGTCCACTTTTCCTGCAAAGTCCCCTCTCACCTTA 

TCTTCC7GGTAAACTTCCAGCCAACACCTCTTTACTAACCAGAGAAACAT 

GGTTCAACTGTGCACAGGCTTGCACAGAAACTGTTCTCATATTGTCTTGT 

CATTGTCAATGTGGCAGAGATGCACC7TAGATACCTCTTTGAGAAAGGAC 

TCACTGCCCAGCTGCCTGGCACGTGATGAGCTGATAGCTCCAGCTATAGA 

CTCCTTTAGGGTCAACCTCTGCTTTCCAGTTGAGATCATATCCTTTGCAG 

GGTGGCCTCCCCAGTGATGACTAAGGCAGTGTTACAATGGCCTAGTCATT 

TCCTCCCAATGCTGGACTCCCAATGAACCATCTGCTCCGGAGCTTCCCAC 

TGGGCAGTCAGAGACCTTAGCTAGTCTGCCTCCGAATCAGAAGGCTCTCT 

CTTGCCACTCTGGCC 

>Contigl2 

GCTGTGTCTAAAGATTCACGGCTGTAGTTCCAACTCCCGCCGCCCTCTAC 
TGTGTCCTCTTAATGGCAGTCATTCACCATCTTCCTGTCCCTCCCCTTCA 
TTTCTTGGATGGTGACTGTCACTTTGCTGCAACAGAACCCTGTCCCAATC 
CTTGATGGTTCAATACACACATAGACATTCTTTTTAACAGGGCGGCCTCT 
CAGGTCTTTAATTTTCTTCCCTCCAATAACCTTGTGATGATCCCCCAGCT 
TAGCCACTTACTGCCAGATCATTACCAGTAACTCCAGCCCCTCCTTAATT 
CTAGTTTCTAATATCCTAATCTGTGACCTCACATTCCAACTTCTTCATTC 
TTATCCCCTGAGTCAAAAAATCCTTTGATCCATGCAATCCATTAAGTCAT 
CTACCTTTTCACCATTCTTCGCCCCACTAGGGTTCTCATTCCTTTATTAC 
CCATATGAAATTCCAAGGCCTGTTGGAATCACTCCCTTGCAGCCACTGTC 
AATACTTCTGCCCCTTTTACTTCATCACCCTTATGTGGCAAAACCACAGC 
CCTGGTGGAGTCGATCCTTACCCCTGCTCTGTGCCAACAGCCGCACACGC 
ATGGCTGATGGAGGTTGGAAAAATCCACACATGCAGTGGGCCCTGTATGT 
CCATATACGTATCCAACCTCCAGCCTTGCATATGCCTCAGTGCTGCCTGA 
CAACACATTATATGTTTTCCTTAGT7CCTTCAGTCTCCTGGGTGCCTAGG 
TGAGTATCTCAGACA7CCTTCTCTCTCTGCAAAGCTCCAACACCTCCACG 
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TCACATTCAACTGATGACTGTGTCTCCTATGTCACTTAGATCACAGAGGC 

A7ACA7AAACAAA7CCCAGCCAC7GCCAGCAC7C7GCACA7C7GCGAGCA 

TGGCACCCCCAATCTAGGCCTTTCC7GCTGTCACTTGGGGTGAGCTGATT 

ATACTCGATCCTAGTCATTTCTACTTATGCAC 

>Contigl3 

C77AAGGCC7CCC7C7AACA7777AA777AAGA77GAAAAAGCAAAGA77 

ATXPTGTTTTGGCTGCGCCTATAGTAAAGTAACCCCTATGNCAAATTTTG 

ACACCTTATAGTATTTGACAGGGATAAGTATAAAATTGCTTGATTGATAC 

ATCCACACCCAAATGTATGCTGGGAATGATTTTGTTTCACGGCACTCATT 

ACTTAATTTTTAAAACTCTTATTTAAATTTGCAATGTTTTAAATGACCAT 

CACTTAAAGTAGTAATCAACAGAGGTTAGGAGAACATAACAATACTCTTT 

CTCTTAGAAAATACAACAGAAATATAATTTTTTACAGTTTTGCTCCCAAA 

CTTTTCTCTGTAATAACATGCCTTACTCACCTTTACAATAGGTTTGTTGT 

GAGAATCTTGTAATGTAAACCCTGGGTGTTCTGTGAAGCATTTTTAAACT 

7C7AG777ACAC7GAC7C77A77CAAG7G77777AAAAA7A7A777AAAA 

AACTGGCCAGGTGCAGTGGCTCACACCTGTAATCCCAGCACTITGGGAGG 

CCAAGGCGGGCAGATCACAAGGTCAGGAGTTTGAGACCAGCCTAGCCAAC 

ATAGTAAAACCTCGTCTCTACTAAAAATACAAAAATTAGCTGGGCGTGGT 

GGCGGGCGCCTGTAGTCCCAGCTACTCAGGAGGCTGAGGCAGAAGAATCG 

CTTGAACCCGGGAGGCAGAGGTTGTGGTGAACCAAGTTTGCGCCAATGCA . 

CTGCCAGCCTCTGCAGNGACfiBCfc 

>Contigl4 

GGGGGCGGGCCGAGTGATCCTAAAGCCCGCTCGCTTCACAACAAAGCCTA 

ACAG7CCAA7CAC77AA7GC7GCA777A77CC7GGGGAAGCAAG7C7CC7 

TTGCACTTTACACAGTGAGATAATCAGTTTCTCATGTGGACCACTGGGCC 

AGGAGGGCCTGACAAAGGGCAGTCTACATTTCAGACTGGAAACTGCTCCC 

AGAACTATTTCTTTCTAGTTCCCACCTCGGTCTGAGGTGCCTGAGGAGAG 

GGACTCAACAGAGGAAGCAGGAGCATAGCTCAAAGTGTCAGAACATGGAA 

GAGGAAAAGAATCCTCACAAGATTACGTAACTTACAGGCGTGTTGCTGCT 

TCAGTAGAAGTTTCATCTCCCTCAATCCTGTACACTTTTCCATACATTAC 

ATACTCAAACTGGTCAGCCCTATGGAGCAATAGCAGCAAAGTTATTCTTA 

ACAGTAATTAACAATATAAAAGATCCCATTTAAAAATGGTTACTGGTCAG 

CCGGGCGTGGTNNNTCNANCCTNTAACCCCANCACTTTGGAAAGCATGCG 

GGCGATCCCAAGTCTGATATCGAAACATCTGCCTAACATGTGCAACCCCT 

C7C7ACAAAA7ACAAAAAA7A7CCGGGC77G7G77GGCGCCG77A7C7CA . 

CTACCCGGAGCTAAGTAAGAAATGCTTTACCTGGAAGCGATTTTTTTACT 

TATATCCCCTCTCTTCACCGGGCGCGACCAAATTCTTTAGTATAGGAAAG 

TTTATTGTTTTATGCCTTTGTCAAGGCTCTACTG7ATCTTTTCTGTCCAC 

TCAC 

>Contigl5 

GGTTCTGAACAACAGCAGGCGATTCCTAGCCCTGTACCCGGGGCATTGTC 

CAACACTCGACAGGGCTGAATTCGTCCATAACGGTGTGCCCCTCTGGGAT 

ATAGGATGAAATGAATTGATCTGAGTACCTGGGATGTAAAGTTACTAAAA 

CGCCAGCTAGGTTCACGCCCCGATGCTTAAATATGATCGTGGCCTACACC 

TCGTCCAGCAGAAAAAGTACCCTTTCTTCAACACCACCTCACGATCCTCC 

AATTTAGGAGCTATAAAACTCATGACTCTTTATTTACCCCCTGCAGATTC 

TCAATCCAATAGTGTGTGTCTCCCTGTGAACTCACGGATATACCGATTTT 

CCCCACGTCATTTCCACACGTCGCAATCGCTTAGTCATCCCTATGTATGA 

GAATCATGGATGACTATGTTGAAGTCCATCTATAAAGTTCAACCCCCATC 

TCCGTCCCTGATTCCCCCTCCCCAAGATCACCAACGCGACTCGACATATT 

GTTATCGCCCAAGGGACCTCTTGCATCCCCCATATCCACTGGTCACCTCC 

CCTCTTGGCTGGAAGTCACCGGGAAGTTCTCCACATGTTGT 

>Contigl6 

TGCGAGCGATGTTCCTAAACTTTAGCGCCATTGACTCGAGCATGG7CATG 

GCTGTTTCCTG 

>Contigl7 

AGGGTGT7CCTAAAGGATACTACGTTCCCTAAAGTCCAGAGAAAAAAAAA 
AAAG7AACA7AA7G7GGC77A777GG7A7AAAAA7777ACAGGAAGCA77 
G7CAAA7A7GAAA7AG7G777GG7777G777GGGC7G7A777G7A7AAA7 
A7G77A77GG7A7G7G77CCAAAA77A7AGGAAAC7CC7A7AA77C7GA7 
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ATGACTTGGTGTACATTATCAGTAATAATTATAATTGTTATGGTAAATTA 

TTGTGTGCCATGGAGGTAACAAATTTCCTCATCAAGTGTGTCTTTGACTA 

TGG7TGCCCTAAAACTTTTTGCCATTCACAGACAATTGTCTTGCTTTGGT 

CCTCTTTAGAAGGTGGTTTTATAATCAGCTATAAAACTCTAACGGGTGCT 

CTTGAATGCAGGCTTAAGATAGCTTTGGAGACTGTGACATCAGAATAGAG 

GAAAAACTTTCAGTATTCATGGAGTGCTGAAATATTCATGAATATCAAGC 

AAAACAGGAATTAACTTCATAGATGGAACTAAAAGAATGCTGAAGTAATC 

TTTTTGACTTTTTTTCTTAAAATGTTGATCCTTCGTTTTGTTTTTCAGAG 

TCAAGGAAATTTTTCTGTTGAGATATTGACAGCTTTTAACAATTAAGTAT 

ACTCCAGTGAACACAATTTGGAGCATATTTGTGTCTCTCTATATATATTT 

GGAAACAATNTTTGAGTATTCTTAACTTATTGCAATATT 

>Contigl8 

GGTTGTCTGCTATACCAGTAATGGGATTGCTGGGTCAAATGGTATTTCTG 

GTTCCAGATCCTTGAGGAATTGCCACACTGTCTTCCACAATGGTTGAACT 

AACTGACACTCCCACCAACAGTGTAAAAGCATTCCTATTTCTCCACATCC 

TCTCCAGCATCTGTTGTTTCCTGACTTTTTAATAATCGCCATTCTAACTG 

GCATGAGATGGTATCTCATTGTGGTTTCAATTTGCATTTCTCTAATGACC 

AGTGATGATGAGCTTTTTTTCATGTTTGTTGGCCACATAAATGTCTTCTT 

CTGAGATGTGTCTGTTCATATCTTTTGCCCACTTTTTGATGGGTTTTTTT 

XXCTTSCAAATTTGTTTAAATTCCTTGTAGATTCTGGATATTAGCCCTTT 

GTCAGATGGATAGATTGAAAAAATTTTCTCCTATTCTGTAGGTTGCCTGT 

TCACTCTGACAATAGTTTCTTTTGCTGTGCAGAAGC7TTTCAGTTTAATT 

AGATCCCATTTGTCAATTGGCTTTTGTTGCAATTGCTTTTGGTGTTCTAA 

TCATGAAGTCTTTGCTCATGCCTATGTCCTGAATGGTATTGCCTAGGTTT 

TCTTCTATGGTTTTTATGGTTTTAGGTCTTATGTTTAAATCCTTCTTTTT 

TTTTTTTTTTTTTTTTTGAGATGGAGTCTTAGTCTGTTGCCCAGGCTGGA 

GAGCGAGTGGCGTGTCTNTAGGACGC 

>Contigl9 

GCATGTTGTCTAAAGGTTTGTCTTCCTCCAAAATTCATATGTTAAAACCT 

AGCCCCAAATGTGATAATATTTGGAGGAAGGCTCTTTGGGAGGCAGAGCC 

CTCATGAATGGGATTAGTAGCCTTATAAAAGAGACCCCTGAGGGCTCCCT 

TGTCCCCTCCACCGTGTAAGGATGCAACAAGAAAGTATGGTCTATGATCC 

AAAAAGCAGACCCTTGCCAGGTACCCAATATGCTGGCACTTGAACTTCCC 

AGCCTCCAGAACTGTGAGAAATAAATTTCTATTTTTCATAAGCCACCGAG 

TCTATGGTATTTTGTTATAGGAGCACAAACAGACTGATGTGCCACCCAAC 

CATGATTATACGTGTAATTTATGGTTTCTCTGCTAGTAGGGATGCACCAT 

GGGGTTAGGAACCACGCTTTTCTTATTTCCCACACAGTCCTTAGCTCTAA 

GCATGTTCCTGAATCAAAGATCCCCATCTTTTATGAATGAAGGAGTCAGT 

GAATGAATTAATGAAAGAACTGATAACCCTCAATAATTATTCCAGCCTTT 

TATACCTACTATTAA 

>Contig20 

ACGGTTCTCTAAAGACTTTCAAGAGCTGGATTTTATGCTTTAGGTGAAGG 
TGATAAAGTAAAGTGCTTTCACTGTGGAGGGGGGCTAACTGATTGGAAGC 
CCAGCGAAGACCCTTGGGAACAACATGATAAATGGCATCCAGGGTGTAAA 
TATCTGTTAGAACAGAAGACACGAAAATATATAAACAATATTCATTTATC 
CCATTCACTTGAGGAGTGTCTGGTAAGAACTGCTGAAAAAACGCCATCAC 
TAACTAGAAAAATTGATACCATCTTCCATAATCCTATGGTACAAGAAGCT 
ATATGAATGGGGTTCAGTTTCAAAGACATTAAGAAAATAATGGAGGAAAA 
AATTCAGACATCTGGGAGCAACTGTAAATCACTTGAGGTTCTGATTGCAG 
ATCCAGTGAAGGCTCAGAAAGACAGTACACAAGACGAATCAAGTCAGACT 
TCATTGCAGAAAGAGATTAGTACTGAAGAGCAGCTAAGACACCTGCAAGA 
GGAGAAGCTTTGCAAAATCTGTATGGATAGAAATATTGCTGTCGTTTTTA 
TTCCTTGTGGACATCCAGTCACTCGTAAACAATGTGCTGAAGTGGTTGAC 
AAATGTCTCAAGTGGTACGCAGTCATTACTTTCAAGCAAAAAAATTTTAT 
GTCTTAATCTAACGCTATAGTAGGCATATTATGTTCGTATTATCCTGATT 
GAATGTGTGATGTGAACTGACTTTAAGTAATCAGGATTGAATTCCATTAG 
CATTTGG7ACCAAGTAGGAAAAAAAAATGTAAAGCCAGTGCTTAGACACA 



>Concig21 

CGCTGTCTTAAGAACTGGGCTAGGAGTGAGCAGTGAGCCAAGATCGCACC 
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ATTGCACTTCAGCCTGGGCAACAAGAGCAAAACTCCATCTCAAAAAAATA 

CATATATATATATGACCCATAAAAAGGAGATAAATCAACACTTCAGAACT 

GAC C CAAACTTG CAAAGATACT ATAATTAACAGAAAAGGACAGTTTACTA 

AGTACTCCGTATGTTCAACAAGTGAAAGATTAAACATATTAAGTAGAGAT 

GTAGAAGATATAAGAAGATCCAAAATGAACTTTTAGAGTTGAAAACTACA 

ATATTTAAGATAAAAATACACTAGGTGGGATTAAAAGTAGATTACACATT 

GC^TAAGATAAAAAAAAATGAGCCTGAATACAGCACAGTATAAACTATCT 

TAAACAAAAACACAGAGAGAAAAAATAACTTTAGAGACTTAGCTCTTATC 

CTCTATTTGTTTCTAAACAGAGGATAAGGGGCAGAAAAAATGTTTGAAGA 

AATCATGATTTTTAAATTTCCAACTGAGATAGGAATAGCACTGGGTAGTC 

ACAGGAGGCTGGAAAGACCCAAACAGCAGTTAAAACAGGAACTAGGCAAA 

GAAACCAAAGGATAACAGTAAACCTAAACTAAGGGAGAGAAAACTGACAA 

AAGCTGACTTAGGATAACTGAC 

>Concig22 

CCTGAATATAAGCCGCAAGTAACCAATTAAATTTGTTTTCCAAAATTGTA 

TTAACAATCTATGAAATTTTTATCTTGACCATAGCTATAACTTCCAGAAG 

CCTTTTATAACCTCTATAACCTTTATTAAGGAGTAGGTTAATGCTTCAAG 

AAAACCTTGTTAATCTGACACAGGACCCATATGCTGATCTTGCATCAGTG 

TGGCTTGGACATCAATGATTATGATTAATTTATAGAGAAATTGAACTTAT 

XTTA-TCTGTCAAAATTGGCCCTTACAATCTGACACACCCAeCTGTTCCAC - 

TATAGTTCCTGGGCCTTGAGTTGAATAGCTTTAATTTCTGGCTCfSTGTT 

TCAAGAATGCAGTTTATTTTGATTGGCATTTTCTACCAGTCCTGAAGATG 

AACCTTTAATTGCTGTCAGTATTTAAGATTTAGCAGGACTTGTCCTTTTA 

AGAACCAGGAGTCAAGCCCTATAACTCAATGTCACAAGGACTTTAAAAGC 

ACATACATAAAGATATATGGATGTAATAATCATAATTTTTAAAAAATTGT 

ATTAATCTCAGTGTTTTCTAAGCAAACCAAAACTTAATAATAATGGCATA 

GAAATTATTTCAATAAAACATAAAATCTGTTAAGCCAGTTACCAAAAGGC 

AAAAGAAAAGACCTTCTGCAATGCACAGAATATTATGTTGGAAGAAAACA 

TTTCCTTTAGACCTTTAAGAAAACATTGTTAGCATCAGGACACAACAAAC 

AGAATCTGAGGGTAAAAAACGTATATGAGCTGAAGGGAGTTGAAGGAGGG 

CATTACTATTTCCCACCCTTTTAAAGGGGAGAGAAAACCTAAAACAGCAA 

GATGCAATAAAAGCTGAACTTTGGGTTAAAAAAAAATTCTTAAGTCTCTT 

ATAATTTATTAAGAGTGAATCAACCCCGTAAGAAAATTTCATTGTTCTAA 

CCAATTTTTTAATATATAAGTAGTTTTTTAACATCAACCCAATCTCTAGA 

AAGACCATTATAATTTCCCTTTAATTATAGACAACTTTATCATATAAAAG 

TTTTTTTAAATAAATCCTCTTATTGTGACTTACACAGACTATTCATGACA 

TGCTTGGACTTTCTGGTTTGTCGTGAACATCCTTTTCTTTCTTTCTTTCT 

TTTTTAAATTTTACTTTACGTTCTGGGATACATGTGAAGAACATGGAGGT 

TTATTACGTAGGTGTACATGTGCCATGGTGGTTTGCTGCACCCATTAACC 

CGTCATCTATATTAGGTATTTTTCCTAATGTTATCCCTCCCCTTGCCCCC 

CACCTCCTGACAGGCCCTGGTGTGGGACATCCCCTCCCTGTGTCCATGTG 

TTCTCAATGTTCACTCCCACTTATGATTGAGAACTGCAGTGTTTGGTTTT 

CTGTTC 

>Contig23 

GCTAAATATAAGCTATGATAAAACAGTTGGCCCTCTGTATCATGGGTTTC 

ACAACTGTGGATTCAACTAACTGTGGATGAAAAATACTTGGGAAAAAAAG 

AATGGCTGCATCTGTACTGCACAAGTGCGTGCTTTTATTCTCGTCATTAT 

TCCCTAAGCAATACAATATAACAACTATTTATATAGCATTTACGCTGTAT 

TAGGTATTATAAGTAATCTAGAGATGATTTGAAGTATACAGGAGGATGTG 

CTTAGGTTACATGCAAATATTATGCCACTTTATATAAGGCCCTTGAGCCT 

CCTCAGATTTTGGTATCCATGGCAGTCCTGGAGTCAATTCTCCTGCAACA 

TCTCCATTTGTTCAGATTCTCTTCTATATCATGTTTATATCAGAAAATCT 

ACATAAGATTTTTTAATGTGTTCATATAGGTTTTGTGTATTTTTGGTTGT 

TAATCCCTAGATATATGCAGTATTTATTGCTATTATGAGTAGTGTTTCTT 

TACCATGTATTCTAGTTGGTTATTGCTGACAGAGAAATGTTGCTGGTGTT 

TCTAAGTTACCTTGTTTCTAACAACCTTGCTGAACTCTTATTAGTTCTCA 

TAGTTTTTAATTAATCTTTCTTAGTTCTGATAACATAATCTGCAAATAAT 

GACAATTTTATATCTTTCTTTCCAATGCTTATATCTCTCAGTCCTCTTTA 

TCCCAAAGTATTTTCCAGGATCTCCACTATAACATTAAATAGTAATAAGA 

ATTTCTGTCTTGTTACTGATCTTAAGGAGAATAAATTTAAATTTCCTCTG 
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TCAGGTTTTATGCTTGATATAGATTTGTGATATATAGCCTTTCACA^^r 

AAAAAAAAATGCTTTCCTAGTAGTCCTAATTTTTTAAAAAAATCATCATA 

AATAGATGTTGAACATTATCAAATGCTTTTTCTGCATCTATAGAGATAAT 

CATATGGTTTTTTACTATTTATTAATGTAATGAATTAGACCAATTTTCTA 

ATGCCAACTCTTTCTTGTATTTGTAGGGTAAATCCTATGGGATCATAAAA 

TACTTTTAATACATTGTTAGATTTGAAGAGTTAACGCCTTATTTAGAACG 

TTTTCAGTCACATCCATAAGTGAAATGGCACTATAGTGTCTATTACTATT 

ATATTTTTCTGGTTCTGAAACCAAAATTATACTCACCTCATACAGTAAGT 

TGGGCAAC7TTTGTTCTTTTTTTCTGAAACAATTTGTGTATAGAAGAAAT 

TAACTGTTCCTTGAAAGTTTGATAATAATCATCCAGAAAATTATCCCCAT 

CTAGGGCTTTTACAAAAAGGAGACTCTAGAATGCCATTTCGGTTTCCTTG 

ATGTGTATTGGCCTCTTTCATTTAGGCTTTTGGATTTTTTAGGGCATTTT 

TTCACTATAGGCTTTTTACCGG 

>Contig24 

CATAAACTTCAGGTTGGATGTTCGGTCAAAGTGGTCCGGCGATGCGAAAA 

CGAGAGGGCTCGAGGACTGGGCAGAGAACTATTTGAAGGTAXCTCTCAGG 

GGAAACCAAGCGGAAGGCGGGGAGTAAAATTGGGAGGGAGCGACGGCCTT 

CAAAGAAGGGGCTTGCATTAGATCGGCGAGATCCGGGAGGGTCTGGTGGG 

GAGAAATGACTAGAGGACAAATCTAATGGAGAGACAGACGGAGATAGATA 

TCGTGACAGAGAGAGGGACAGTGACACCGCAqAACAGTGCAGGGTCCATG 

AGTACAAGGCCCTTAAGTGTACACCCCAGCCGGAGTCATGGCAATTCGAT 

TCCTGTACTGACCACCCAGGATTTGGGTAGACTGTACGAGTTAATGAGCA 

TGGTCCCCAACAAGACTGCTTCGACCTCAGATGCAAAGCACACTTCAGGG 

GTCCCCAAGCCACTCATGTTTTTTGAATGACTGCCATAAGTTCAAAAATT 

CCCACAATTCTCTCAGATTCAATAACTGGGTATAACCACTCATAGAACTC 

AAGAAAATGCTATCATTATTATTACAATTTTATTATAAAGGATACAAATC 

AGAAGGACTAGCCAAATGAGGAGACACATAGAGAGAGGACTAGTAAAAAA 

CAGAGCTTCTGCGTCCTACCTTCAAGGAATCAGGATGCACCACCCTCCCA 

GCACATCAAGTGCTCATCAACCAGGAAGTTCCTCTGAGCTCCAATGTCCA 

GAGATTTTAGGGAGGATTCATTACATAGGTATCATTGATTAAATCATTGG 

CCATGTACTTGAACTCAATCTCCAGTGTCCCTCTTCTCCCTAGAGGTCTG 

AAGGGTTGGCTAATATCATGTGGCTCAAAGCCCCAACTCTAATTACCTTT 

TTGGTCTTTTCAGGGACTAGACCCCATCCTGAAGCTATCTACAGGCCCTG 

CCATGAGTTAGCTCATTAACATAACAAAGACACTTATATTACTCAGAAAA 

TTCCAACAGTTTTAGAAGCTCCATGTCAGGAACCTGGGACATAGATCAAA 

TTC TTTTTT T TTTTTTTTTTTT GGAGACAGGGTCTTGCTGTGTTGCCCAG 

GCTAGAGTGCAACGACAGATCACAGCTCAATGCAGCTTCAACTTCCCAGG 

CTTAAGTGACCTTTCCACCTTAACCTTCCAAGTATCTGGGACCACAGAAA 

ATGGCTAATTATCCTGGCTGATTTTTAAACTTTTTTTTTTTGTAGGGATG 

GGATCGCCC7GTGTTGCCAAGGTTGGTCTCAAACTCCTGGGTTCAAGCAA 

TCATTCTGCCCTGGCCTCTGTGATGGTTAATACTGAGTGTCAACTTGATT 

GGATTGAAGGATACAAAATAATATTTTTGGGTGTGTCTGTGAAGGTTTCG 

CCAAAAGACATTACTTTGAGTCAGTGGACGGGGAAATCCCCCCTTCCCCA 

TGGGACGGGGAGACCCCCCTCCATCCAGGTAAAAAAATCTAATCACCTGC 

AATGTGGCAGAAATAAAGGAGGGAAAAAACGGGGACCCCTANATGGGTTA 

TTCTCCACCTAATTCTTCCCCCAGG 

>Contig25 

CCATGTATTTCATTTCTACAGACCCTGAGATGAATTTGTCATTGCCACGG 
GGTCCTGAAGTTCAAATACTCTATTTGGTATCCTGCCCCTGTGGTTAACT 
GTGATCATTTCACTCACCTTGTTTATGATGAGAGGTGCCACCATCTGGCC 
TCCTCCACTCTGCAATCCTGTTAATTCCTATCAAAGCTGAAAACCTGCTG 
CAGCACCCACACCATCACCTCCAGCCTAGAGAGGGAAGCTACCAGTGAGC 
TCTCCTGGATGCCGGTGTGCCCCTCGCCAATACATTTCTTCTTAGTCCCT 
TGGTCATCCTGAGGTGTGTGATTAATGGACAGCTATGTGGATTGCACATA 
ATAGATGTACTCCAGCATCTTCATCCCTGATTTTCCTTTACAGAAATCAC 
TCAACCTTAGCAACATGTGAAAATCACCTAAGGACATTCTTTAAATCCCT 
CTGTCCACATGGCAACACAAACCACTTAAATAAGAATCTCCAGGGAGTCA 
CTCAAGCATCAATGTTTTTTAAAGCTCCAATTTTAAGGATCATTACATTA 
TGTCGAAGAAATTATAGTATTTCAGCCTTACTGACTGTAAACCACCACCA 
TATCTAAGCATCCATTAGTCAACCTAGCAGACAATAAACTAACATTACCT 
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CCAGGTACTCAAATCZAATTCATTGCATCCCAAATCCCAGATGGGdCCACC"'" 

CTTATTGACAAATTCAGCCCAATCTTGGTTGAACACATTTAGAATATATT 

TCCATGAACAATATCCGGTTGACGAGTTTCTTTAACTTTTTGGAGTTTAA' 

GCCATTTCCTTTCACAGTAGCCTTGTTAATTCCCTGTCAATGCTCCATGG 

GGG7CATGAAGAGACCTCTTATTAACTGTGAAGCAACTTGGCTCAGGTGC 

AGACACTCAAATGCTTCACATGCAGTGGGAAAAGAGAGTGATTGTCTAC 

>Contig26 

TTTAAAAAGAACTGAGTCTTTATTCAGTCGATTCTTCTAATCTATGAACA 

TAGCATCTCTCTCAAAGCATTTAGTCCTTCTTTAATTTCTGTCATTAATT 

TTTTAAAATTTTCATCCTAAAGATTCTGTATATGTTTTGTTGAATTTATG 

CTTAAGCATTTCACTTTCTTGGTAACAATTATAAATGATTTTGTGTTTTT 

TATTCCACTAGTTCATTTTCAGTGTGTAGAAAAGCAATGAATTTTTGTGT 

GTTGATCTTTGTTC CAACATCTTGCAACATTATTGAACTCATTTATTAGT 

TCTAGGAGGTTTTTTCATTTTTCTTGTAGATACCTTGAGATTTTCTATAT 

AGACAGTCATGTTGTCTGCAAACAGGCACAGTTTTATTTCTTCCTTTTCA 

ATCTATATGCCTTTTTTTTTTTTTTTGCCTTATTGCAGTGGGTAGAACTT 

CTAGCACTATGTCAAATAGCATTGGTGAAAGCAGACATCCTTGTTCCTTG 

TCTTAGAGGAACATTTGGTCTTTAATCTTGATTTAAAAAATTCCTTGCAC 

TAAGTTACCGTGTTTTGCGGGAGGGAGAGGTGGGGTGAGGTGGGGATTTC 

CCCTAATGTTTACAAGCTGGGATTTTCTTTTTCCTGTGTCTAATTATTTT 

CCTCATTGGCTTGAAAAATCTGATAAAACATTTTAGGACTGTGTATAAAA 

TAGAATTAGCCAAGTGCAATGTCTTTATTCAGAAGAAATTTCATGGACGT 

TGTGCCTACTCTCTTGGCTTCCTGGCTTCATGGCTTTCCAGATCCCACAG 

TAAGCTCTGGATAGTAGAAGTTATAGTAAGACTGACTTCTAAATAAATGA 

AGTGACTTTAACCTTACTGATATGGCTTAAAGAAAAGGAGTGGCCTTTAA 

GATCCATGAACTTCTCAAACAAAAGTGATAACGTTATCTCCATGCATATA 

TAATACTAAATATAATGCAACTGAGAGAAGTAGGCTGTGGTAAGAAAGGA 

GACCCAAGTGCCATCTGAAGGCAGCACTTACCACTCTGCTTCATCCCACC 

GAGGAAACAAAGCATGAGTATTGCCAGATTTTCTTCTGTTTCAAGAAAAG 

CCAGAAATCCAGGTTTTTGCGTGAAATGTCCTGATTTTAATGTTGGGAAC 

TAATTTATATTTTGAAATAACATTGTGTGGGACAAGTGAACTTGTATGTG 

GAACTGCTTTCTCCCAGTGGCGACCAGTTTGGACCGTTGATACTCAGCAA 

GTTCAGCCAAGTGCGCCTTGTCATTGTCAGTCATCAAGGTGATGTGTGAT 

TGGTCAAGCAATTAATTTTGCTCAGCATCTCGTGTGTTTTCAAAAGAACT 

GAAGGTTCATTTGC 

>Contig27 

TTTCAGAGCACAATGCGTATTCATAGTATATTGACTTAATTTCTAAGTGT 

AAG7GAATTAATCATCTGAATTTTTTATTTTCAGATAGGCTTAACAAATA 

GAACATTCTGTATATAAATGTGTAAATTAGAGTTAATCTTTCCAATCACA 

TAATTCGTTTTATGTGAAAAAGGAATGAACTGTTCCATGCTGGTGGAAAG 

ATAGAGATTATTTTTAGAGGTTTGTCGTTGTGTTTTGGGATTCTGTTTTC 

TTTTAAAATTGTAAATATGTACTTGTGTGAATGATTTTTTAAAATGATTT 

TACCATTTTTGGAAGGGTATTTAATGATAGAATATCATCGAGCCAACATG 

CACTGACATAGAAAGATGTCAAAGATATATTAAGTGTAAAATGCAAGAGG 

GAAAACACTATGTACAGTCTGAGCCAAATCAAAGCATGTATGTTTTTTAT 

ATGTGTACAACAAAAGGTTTGGAAAGATATGCGCCGAATTGTTAAATGTG 

GTTTCACTTGAGGGGGTGGGAGGATGGGGCCCCAGAGGGGTTTTTATGGG 

GGCCTTTCACTTGGTATTTTTTTCATTTTGTTCTGTTTGAAATTTTGTTT 

TTTCTTTTTAAATGGAGTTTCACTCTTGTCGCCTAGGCTGCAATGTAGTG 

GCGTGAACTCAGCTCACTGCAACCTCCGCCTCCCAGGTTCAAGTGATTCT 

CCTGCCTCAGCCTCCCATGCCTCCTGTGTAGCTGGGATTACAGGCACCCA 

TCACCATGCCTGGCTAATTTTTGTATTTTCAGTAGAGATGGGGTTTCACC 

ATGTTGGCCAGGCTGGTCTGTAATTCCTGACCTCAAGTGATCCACCCACC 

TTGGCCTCCCAAAGTGCTGGGATTTCAGGTGTGAGCCACCACGCCCAGCC 

CTGTTTAAATTTTTTATAAGTATGTACTACTTTTGTAATCAGAATTATTA 

GAAAGCATTTTACTGATTTAAAAGCTTAGACATGTTCAAATGCCTGCAAA 

ACTACTTAACACTCAGCTTTAGTTTTTCTAATCCAAAAAGGCCGGGCAGT 

TAATCTTTTTGGTGCCAATGTGAAATTTAAACGGTTTTATGTTTTTCCTG 

TGTTGTGAATGAAAAATATTTCTGAGTGGTGGTTTTTTGACAGGTAGACC 

ATGTCTTGTCTTGTTTCAAAATAAGTATTTCTGATTTTGTAAAATGAAAT 
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ATACAATATGTCACAGATCTTCCAATTAAGTAGT 

TCCTTGCTAATTTAAGCTTGCATAAGTCACTTTACTAAAAGATCTTTGTT 

AAGCTAGTATTTTAAACATCTGTCAGCTTATGTAGGTAAAAGTAGAAGCA 

TGTTTGTACACTGTTGTAGTTATAGTGACAGCTTTCCATGTTGAGGTTCT 

CATATCACCTTGTATCTTGAAGTTTCATGTGAGTTTTTACCATTAGGATG 

ATTAAGATGTATATAGGACAAAATATTAAGTCTTTCCTTTACCTAAGTTT 

GCTTTCTTGACTAGTAATAGTAGTAGATATTTCTGTAATAAATG7TCTCT 

CAAGATCCTTAAAATCTCTTGGAAATTATAAAATTATTGGAAAGACAAGA 

ACAGT7TTTATTCATTATATGCATTATTATCG 

>Contig28 

CTTTCTCAAGAAAAGGGAACTGGAGCAATTAAACATATGTAATTTTTTTT 

TAAAAAACCCTAAACCTAAACATCTACCTATATACAAAAATTAATTAACA 

ATGGATCATGGACTCCAATGTAAAACATGAAACTCTAAACTTCTAGAAAA 

AAAACTGGAGAAAACCTTTGGTACCTATGACAAGGCACAGTTTTTAGACT 

TAACACTAGAAGTGTGAACTATACAAGAAAAAATTAATAATTTGAACCTT 

ATGAAAATCAAATTATTTGCTCTCCAAAAGACCCTGTTAAGAGGATGAAA 

ACTAAATTACAGATTGAGAGAAAATATTTGTAAATCACATATTTGACAAT 

GGACTTGTATCTAAAATATCTAAAGAACTCTCAAAACTCAACATTAAAAA 

AAATATCTAATTAGAAAATGAGTGAACATTTTACGAAAGGGGCCTTATAG 

ATTAGCAAATAAAACACTTGAAAAGATAGTCAGCATCACTAGCCATTAGA 

AAAATGCATATTAAAACCACAATAATGTATCGCTACACACATATAAGAAT 

GGTTTATGAAAAAATAGTGATGACACCAACTGTTAGTGAAGATGTGGAGA 

AACACTCATACATTGCTGGTAGAAATGTAAAATGGCATAGCCACTGTGGA 

AAATTATTTGGCAGTTCCTTTTAAAACTAAAAATCAATCTACCACACAAC 

CCAGCAATTTCATTACAGGGCATATATCCCAGAGAAATGAAGATTTATGA 

TCACACAAAAATCTGTACACAAATGTTTTATGGTCACTTTATTCATAATA 

GCCAAAACCTGGAAACTATCCAAATGTCCTTCAATGGGCAAAGGATTAAA 

CACACTGTGATACATCCATACCATGGAATACTACTCAGCAATAATAAGGA 

AAGAATTACTGCTACACACAAGTTGGATTAAACTCAAGGAAATTGTGCTG 

AGTGAAAAATTAACAAGCCAATCTCAAAGGACACATAGTTCATGATTCCA 

TTTGTATAACATTAATTAACACAATTAATTACAGAGATGGAGAACAGAAT 

AGTGGTTGCCAGGGATTATACATGGTGGACGCGGTGAGGCGGGCCTCCAC 

GCCTTGGAGATGAAGGGGGCTACACCCTTTAAAGCACACCCACGAGAGAG 

TTTTGTGCGGAGGGGCCCAATTTAAGTACTCCGCCCCGGGGGGGGAACAC 

AGGGGCAAACAAAAAAAATTGGCCTTGGGGGTGACCAAACACACAAAAAA 

AAAACAAACACACAAAAAAACAACNATGGGTGGGAGGATTAATCGCCAAA 

TCTGAGTAAGCTATCTGGACAGTACCAATATCGATTTCCCAGTTTTGATG 

TTGTACTA7AATAATGCAAGATGTTAACATTGGAAGAAGCTGGCTGAAGG 

GGGCTCAGGAACTCTCTGGACATTTCTTTGTACCTTCCTGTGAATCCATC 

ATTATTACAAAATAGGACATTTTCTAAAGGTTAAATCATTTTAATTTTAA 

AATGTCCCTGTTACTGTTGAAACTCACATCTCCATATACTGATCAAGAAC 

AGCACTAATGGCCCCTGGCCTCCAGGAATTCACAATTCCTACTGACTTTT 

CTTTGAAACCTTGGCCAAGTCGCTTCTCTTCTCTGGTCCTCAATTTTTCA 

TCTTCAAAATGAAGATTGAATGACTATTAAAATCTCTTGCAATTCTTGAG 

ATGAAGGGTCCTAAAGGAACTGAAGAGGATGCCATGTAATGTAAATATGG 

GTTTTTACTCCATCAGCCAGCCAAGACAGAGGGCAGACACCAAGACATGG 

TAACCAAGGAGGCCATGTGTAAACAAAGACCATTTAGACTTATGCTCTGG 

CCTTTGCAGCCCAACTGGTGTGGCCAGTTGGTGGGGTATGAAGAAAATGG 

GGCCTTCCAGGAACCATGTTGAGTGGAGATAAGCAGGGAGGAATGCAGAA 

GACATGGGGGCAGTGCCAGTCTCAGCCCGAGCCAGCTACACCCACACATG 

GTTATGAAAGACTGACAGCCTGTAAGNTGAACACAGCCCTGCCTCTCTTA 

GATAGGC 

>Contig29 

GCAAATATGATCTCAGATGTGGATTTACTGTAAAGTTCATCAAATTTAAA 
TTTCAGAACACTTAATC7GCAAGAGTCCTTTCCAAGACCCTATACCTAAT 
TTTG7GTTTACAATTTTATATTTGTTTTCTTAAAGAAGACCACCAATATA 
AAC7A7A7CCAGCC77CA7GA7AAG7ACA7AGGAAAC7A7GCAAA7AAGG 
GGGAAAAAAAACAAAGAAAAA7ACC7AG777AC7AA7GG77CAC77C7GA 
A7AGCACA7A77CA7AA7GA7ACAAGCAC7CA77AC7AG7C7AGGAAAA7 
GAAGA7A7AA77GCAT7AGGAAGA7CAAGAGG7AGGAAA7G7GGA7G7G7 
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GTGGTATAGACTAGGGCAGGACAAAGAACCTAAATCCTCATTTTCTAAAG 

ATAATTGTTAATACGTAAAACTCAAAATTCAAGAAGTAACAGTAAAAGCG 

GTCATTAAGAAACAAGCACTAAACACCAGATAGGAAGCGAGAGATGGGGG 

AAGAGGGCGACAATCTGATTATTTTTTGCAACAAATTTTGTAAAACCATT 

TGACTGTTTACATGTAGAACTTGGATCTTTTTTAAAAAACACAAAATAAT 

AATACTATTATTTTTTAACTGGATTTTTGAAAAAGAAGATAAAAGTCTCA 

TTrTAGTAATTAAAACTCATTCCAGGTTAGTCCACTCAAAACTTATATTC 

GAAAATTAAAACTTTGGGAGGCTGAGGCAGGCAGATCACCTGAGGTTGGG 

AGTTCGAGACCAGCCTGACCAACACGGAGAAACCCCGTCTCTACTAAAAA 

TACAAAATTAGCTGGGCGTTGTGCATGCCTGTAATCCCAGCTACTCGGGA 

GGCTGAGGCAGGAGAATTGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAG 

CCGAGATCACACCATTGCACTCCAGCCTGGGCAACAAGAGTGAAACTCCA 

TCTCAAAAAAAAAAAAAAAAAAAAATTAAAACCTCTGGAAGTTGAGTTTG 

CAAATATTCATTATGCTCATTTTTAACTTGTATGTTTGGAAAATGTCATG 

ATGAAAATTGAGGTTGGGGGATGAGAAAAAAAGAAAAACATCAACCCCAC 

AGCCCATTCAATTTTCAGCCCGACCCACAGCTCCGGGGAAGGGCAGCAGG 

TCCATCCTTCACTCTTTCTTCACCTCTTTCCCCTCCTTCTGGCTCTTCCA 

CCTCTAATTTGGAGCCCAAAAAAAGGCACTGGGAAATGGAAAAGTCTTTT 

GTACGTGGTACTTGCCGGGGAAGCTGCCATGAAAACCTGGCCCCACGGTG 

GGGAGGGAATGCCCANCTGAGGCCTCGTGCCCATGCTAGGATAGAGTGGT • - - * 

CCAAACATGTCAC^TGGTCTGAC^GGGCAAGCANCANGAAATCATCSTATG 

AGTATGAACTGATCTGTATGCAAGGGCGGGGAGAACACGCGGAGGAATGG 

GGCGTGAGAAAACAGCACAGTACGTTTCTTTAGCAGCTGTCTCTGCTCAG 

CCATGGGAGGTCACAGAGAAAGAGGCTTGGAGGCGTTATTTTCACTGTGA 

GATGTGAGTGTAAAAAAGTGCCCAAGACACAG7GAGTACCAGGGAGATGC 

CCTCTTTCCTACCCGAATGCAGAATGGCCACAGGCCTTAAAACACACACA 

TGGGTCCTCAGAGGAGAGAGGCCTCCACAGTGGACACCCGCATTCTCCCC 

TGGTCAGCAGCAGCAGGGCGAGTGCTGGGCCATCATGAAGCTTCACAGGC 

AATGAGCTCTCAGCAATAACAGGAACAGTGCCTGGGGGACTGTAGCTGCA 

AGACCGATTTTCATGTAAGATGGCCTCTGAGGACTCCGAGATACACCAGG 

CTGAGACTAGCTGGCAGCTCCAAGTTGTTGGTCAGAAGAGAACAGGAACT 

AGGGAAATTGGAATTACTGTTACTACAATTCCTTTACATCCGCACAACCA 

TGAGGTCCAGCGATTTTCTATTATTTTTTTTTTTAAGACAGGGTCTCAGT 

ATGTCGCCCAGCATAGAGTGCATTGATGTGATCATGGTTCAGTACAGTAT 

TCACGTCCCAGGCTCAAGTGACCCTCCTGCCTCAGCCTCTCAAGTGGCTG 

GGACAGCAGTTGCATGCTACCAGGCCAGGCTTTTTTTTTTTTTTTTTTTA 

GTTTCTGTAGAGCACATAGC 

>Concia30 

GGTTAACAATGGCACAGGGAAACAAACAGTTCCAGG7GCAGGGGCTCTAA 
ATCTATCATAAGATGTTAGGTATGGGGGCTCTGCCGGACACAAACTCAAG 
GCTTTATGCTGTTATCTCTTGAGCGAAATCCTGGGAACTTCGTACATTGC 
TTGCTTCAGTACCTTATCAGTTAATCGGACTCTTTGATATGTTGGGAGTC 
AGCGTACACAAGTTAACTCCTTGAGGAAGGGGGTGGGTAAGGAGTCCTTG 
ATGTCTGGTAAATGAAGGAGCGAAATCGAGTTCCTCTGGCTTTCTCAGCT 
AAGGGAGAGCTTATTCATGTGGAAACAAGGCTAAGTGATTAAGGGAGAAA 
GGGAGAGTCTGAAAACAAGGTTAGGTATTACAATGTCAATAAAATTGGTC 
TCCTTATACAGTCCTATGGTAGATTTCTTTCCATCTTTAATCTCCCTCTA 
GCACCACCAGACTTTTTCTCTCTGTACCTTGAGATGTAAATTTTGCTATC 
TGAATTTTCGTCTAAGAGTTGTTTCCTTTAATATGCAAATTTAGGGTTAT 
TTAGCTGACAACTGCCAAAGTAGTGAAACAAGTTATCAAGAACTTGAACG 
TCTAAGGTAGGAAAAAAAAAAGTCTTTATGAATCTATAAGATGTACTTCT 
ATTGGCATGCCTAATACGTCTATGTATTTACGTGTTGTGTACACAGTTTT 
TCACTACTGAAAATATATAGAGGAGTTCTAATTAATTGACTTAAGACAAT 
AAAAGCGCTTGAATCAAATACCTTATCAGGAAAAAGGAAAAGACAAGTCA 
AATGCTTGTTCAAGTTTATATAACTTAAGTAAAATCTTTAATAAATAAGC 
TAGCTTTAACATTATTTGAAATGTCTTAAGAATTGCCAGCAGGTTCTGGG 
TTACAGAACTAGTGGGGGTGCAGTGGGGTGAGGGT7GGTGGGGTGGGGGG 

7GGCAGGACCTTGGCAGCACCCCGAGCCGGCATGGCGTTAATAATGGAGG 
GATGCCAGACCCAAGTGGCTAAGGCCCGGCTGCAGAGCCAAGTTGGCATT 
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7CCAGAC7GGGGC7CGGGCCGCACCC7C7CCAGGACCC7CCCC77G7ACC 
GAGCAGATTGTCGCGGGCAGTTTGGGCCAGCTGTCCTGGCGTGGAATTTC 
CCAAATTCAACAAATCCTCCAAGAAATCAATCCATCCATTCATCCATCCA 
TCCATCCATCCATCCATGCATCCATCCATCCGTGGCAGATTATGAAGCAT 
GGATCATTACTTTTGGGATGTGGATATATTCAGTTAACAAGGAGCAGCTT 
TCAAGAGCTGGATTTTATGCTTTGGGTGAAGT7TAGAAACACTAGCTCCC 
AG 

>Contig31 

ACCTCATGTGCTCTAGCGCCTCTTACCTCATGCCCTCCACTCTCAGTCTT 
GCACTCACCCTGCCACACTCAAGGGCTTCCCCAGGTTCCTTCTTAGATTC 
CACCGATAGCTCAGGGACTTTGCACATGCTACGGTCTCTGCCTGGCTCCT 
CCCCAGATCTTCTCATGCCTAGCTGCTTCTCATCAGCACCCCTCAGAGAC 
TGTCCCTGCCCCACCTCTCCAGGTTCCATACCTGCCACCCTCCCCCAATC 
ACGTAACAGTTTCTTCACAGAGCGAGTTACCATCCCAGTATTTCCCTAAC 
TTATTTTTTGTGACTGGTCTGTTGCCTGTCTCCACCACAAGAACATAAGC 
TGCATGTGAACAGGAGCCTTGTCTATCTTGTCACCCCAGTGGCTGTGACA 
TAACCTGATACACATTAGATGCTCAATGATGTTTGATGAATGAAGTGCTG 
GTAGTCCAACTGTGTTTCCTTGTCTGTGTAAGTATGTCTGTTGTGGTTTC 
CTAAGAACCTACAGCTCTCCCACTGTGACTCCTGTTCTATGGTCCTGATT 
t TGCTGGACTAGAATCCTAACCTACATGCTTACTCTTAGTGTCCTCCCGCA 
GAGGCTGAATCCCAGTCCCTAAACCTCCACCAAATGGCTAAGACCtAGCT 
TCCAACCAGACAGGCCTACGCTGAGACCTCAGCACCGCCCTTCTGCGGTC 
TCATCCTTAACGCATCCTTCAGGGCCCAGCTTAAATGTCTCTTCTCCAAG 
GAAGGC7ATCC7CTTTCTGCCCCTCAGTGCTCTCCATGCCTCCTCTATGC 
CTCCATGCCTGCTTTCCAACCCTGCAGAGGTGGAGAAGTTGCTAATCTGC 
TGTGTTGACATGTGCTGGGGTGCCTTGGGCCAGGGAGCAGGCTGGTGGTG 
TGCTGATAGCCCGTGGCTGTGCCCAGGTCCATGCTCACTTCCTGAGCCCC 
AGTGGAG7AGGCTCCCTTTCCCTTATTGCAGCACTCAGAGGAAGGACGTG 
CTTCTTAGGACAGATCTGGCCAACCTCTCCCTCGTGAGAGAAGGCCCAGC 
CATCCTCTTGCCCTCTTTCTTTCTCCTGCCCCCGAGTAATAAAGGTGCCT 
GGTCAGAGCCTTCTAGAAGGAGACCCAAACATCCACCACACATTCCCAGT 
TCCAACCGTCATCCACATGGCTGGCTGTGCAGGTAAACGCAGAGTCTGTT 
TCACACACCCAACCATCTAGTATTGGATGGGAGGACAGTAGCGTGACACT 
CTTCTCCAGCCTTGAGCCCTACTGTGGGCCCCACCCAACCCAGATACCAG 
AGGAGCCC7GTACTGGGATGCTATTGGATGCTTGTCCAGTCATGTACAAA 
GTTAGCCCTTTGTTATATAGAGTTAGCTACGTACATCTTCCTCTGTAGGG 
AACCCAAGAGGGGAGAAGAGATATGTAGTAGGATTTAACC7GCAAATCCT 
C7GCTGAGCACCG7GCAC7ACA7ACAG7GGG7AGCA7G7GG7AGG7GC7C 
AA7AAC7A77GACCGA7C7A77GAA7ACACG7AAGA7CG7GACAC7A7C7 
AAAACGNGGGG7G7GGGGGAAAAACCCCCCCC77G777AGGAAACCCAAA 
77GGACCG7G7TGGC 
>Contig32 

GCGCGA77G7GC7AAAGA7CA7GCA7GCC7GA7CAAACG7CCCCA7A7GG 
CG7C7CAGAG7CAAC7CC77CCCCA7CAG7GCCC7GAC77CGGCA7AACA 
AACC7GGCAGG77AAG7GA77AA7CGG7CCTG7ACAAC7G7AGCCC77AG 
CAGGAAGCAC7AAGC7TCG7TTTCA777A777C77CCC7GGAAC7GCAAG 
AAA7GAGGGA7GCC7TCCGCCATGAAG7777GC7GA7TG7CCAC7T7GT7 
C7CAAGGAGA7A77CACAG777T7AA777GTC777C7C7CCTGCA7GG7C 
7CCAAACC7G7CCAAAGAAGCCAGC7GGC7CCA7CA7C7G7AAAA7CACC 
A77G7CACCAGAGCAC77GAC7TCC7G77GCCC7ACAA7CCACC7GCAC7 
77A777CC7GCCACCA7GA7AATG7AG7G77AC7ACA7777ACA7TCAGC 
7G7AAGAAA7G77ACA77CA77TAC77AAA7CAAA77AAG7C7GC7CAC7 
CAG7CCCCCACAG7GACCAAC77A7AAAAGAGAAGG7ACA777CAG7CA7 
CAC7GAGG77C7C7C7ACCAC7GGAAAAC7GAGGAAGGG7C7GGAG7CCA 
CAG7GG77AACA7CA77GCC7C7G777777C7CC7AC7CAA7G7AACCA7 
CCAAGG77AC7CACAAA77CACAAAAAGAGG7C77CACC7C7GC7C7CAA 
GACCCAGAGGGC7GGG77C7AAAC7CAAAGGCCAA7G77CCCCAAC7777 
7GCA77G777CAACA77GGGGAAAAC7CGAGGGGA77CAAGAA7GG77A7 
A7AAG7777G7GGAAAAA7G7A7AA777777AAAA77AAAA7ACAAAG7A 
T7A7GGAAAGCAC7AAA7A77GAA777A7A7AAA7A77CCAAA7A77777 
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CTAAATTTTTAGTGAGAAACTTGAGCTTGCTTCTGTGAGATATTTATTTT 
AAAACAGATTTGACACTTAAAATGTCTAATCAAGCCTTTTAAACCATGAT 
CTATCTCTTCAAATTCTTCAGATGCCACCATCAATAAAGAAAC7TTGTTC 
ACACAAGTAAGTGGTAGCAAATGGCAGGGTGTTTATCATTTTTTTTTTTT 
CTTTTTTTGAGACGGAGTCTCGCTCTGTCGCCCAGGCTGGAGTGCAGTGG 
CGCGATCTCAGCTCACTGCAAGTTCCACCTGCTGGGTTCACGCCCTTCTC 
CTSCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCACCTGCCACCACGCC 
CGGCTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCGTGTTAGCC 
AGGATGGTCTCGATCTCCTGACCTCGTGATCCGCCCGCCTCGGCCTCCCA 
AAGTGCTGGGATGACAGGCGTGAGCCACCGCGCCCCCGCCGTGTTTATCA 
TTTTTTGCCTGATGAAATTTTCCTTGCCACTACTCTGGATGGTTTGATAC 
ATTTAAATTGTGCTTCCAGGGTACAATTATCCTTTAAATCTATACCTCTT 
TCCTTTCTTTTATTGACAAATATAATGTTACACTTTTCTGTCATTGCAGC 
CACACCACCAGTACACAGATCCCAACAGAGTTGTAATATTTTATTAGTTT 
CAGAGTTTCAATATTTTATCACTTTCAATACTTCATGTGCAGGAGTTTTA 
TTTGGTACTTCTTTACAAAATAAATGATGTGCTTCCAAGCATTTCTTTTC 
AATAATTCCAATCAATGTTATTAACTGAGTAATACTAGTATCTGTTTATT 
CATAAATTCACAGGAAATGCTTTTTTACTTATTAGTCTTTGGAATTCTGT 
TGTTTGTATAAACATCTTTCATGATGGCTTTGTGTCTACCAATAGCACTA 
■ TTGCCAAAAGGCACCTTTTTCTTGTTCCTTTACTTCACTGGTCCG^GCC ' 
TGGTACCAACAACTACCACACAGACTGGGAAATGAGCAATTTTGCCACGT 
3CCCTTAGCTATTAATGGTGGCACTCCATAACTAGCATCTTAAGCTCAAT 
TTCATGAAAGAAATGTGTTTCTTATTTTGTACTTGCAGGCACTTTTTAAA 
CTTGTAATCTTTTATTCATACTTTAAAATTAAAACAGAGTAATAGAACCC 
ATAGAAGGAAATCAATACCCACGAGTCCATACTGATATAAATAAATAGTT 
ACATAAATAAATGGGGGGAGAAATAACAGCTCTTCCTTACAGAAAAATTT 
CAATTAATAAATGAAGAAGGAATTAGGGAAATACAACGTTACCATTAAGC 
AACCACAGTAATAATCATTACAGGCAATATCCAAAAATAAATTCCAAAGC 
CAGTGGGCAAAAGTTTGAGGAGATACAGGATATTAACATAGTCTCCAAAT 
AGCTCATGCTATTTATAAATTACAAAAGGAAACATAACAACTGTATAGTG 
AAGAAACTCAGCAGACACCACCTTAGCCAAGTGATCAAGGTTAACGTCAC 
TAGTAATAGGGCTTGTTGACATACTGGACTCCAATCTGATACACTGATAA 
GGACACATGACTTCTGCAGTATTCTTACCAAAAACAGAATTCTAATGTAA 
TTAAGGAAAATGTCAGACAAACCTATTCTGAGAAACATTCTATAAAACAA 
\ CTAACCAATACTTTCAAAATTGTCAAGGTCATAAAGACCAGGCGATGGTC 
ACAGATTTGAGGAGACTAAGGAGATACAACAACTAAATACACAAATGGAA 
CCATGGCATTC7TGATTGGATCTTGAAACAGAAAAAGGATATTAGGAAGA 
AAAGCTGATGAAATTCTAATACATTCTGTAGTTTAATTAATAGTATTGTA 
CCAATATTAATTTCCTAGATTTGATCATTATACTATGGTTAAGTTTTTAA 
CATTAGAGGAATCTGGGAGAATGGTATATATGAACTCCACTGTTCATTCA 
ACTTTTTCAGTAACTATTATTTCAAAATAAAGTT 
>Contig33 

GGGAGCGGCGGCCCACGCTGATCTCTAAAGCTTTAGACCACATTGGCTCG 

AGCATGGTCATGGCCGTTTCCTG 

>Contig34 

GACGTCTTAGCGCTATATTATAAAGAAATATTCACCTCCCTGCTGAGCTT 
ACAGGGTGTACCTAATGTCCAACAATATGAAATCTCTTCAATGAATTGCA 
GCACGTCCATATATAACCCACATGGAAGCTGTCCTCTTTCCTCACCTTCG 
AACTTCCCATGCCAAAGAGGGACCTCTTGGACTCAAATACATCTTAGCAA 
TATAGAAGATGCTGGAGACTTGTAGGAGAAGTGGAGAGGGTTTACAGTGT 
AGCCCCACAGAAAACAACTTATGACCCCATCAGTCACTTGTCCCTTTTTT 
CCATGCCTCAGTCTAGTCAGGAAACCACTAGATCCTGGATGGCTTCTTCT 
CCCTTCCCCTCCTTTCTCTTCTCCTCTCCCTCCCTTGCTCCTCCTTCCTC 
CATCACCCACTCCTTACTTCCAACCAAAACTTGACTAGCTCCAGTCTCAT 
CCCTCCTTATTGAAAACTATTTTACTCAGCCCTCCTCCCCCACTCCTGCC 
CAATCTTTATTCCTTACCTACATCAGACTTCACCAAAACAAAGGCCAGGA 
TAATAAACAGGACAAACTCTTTCAAACACATTTTAATGACCATATTTTGT 
TATTTTGGTACAATTTGAGGAGTCCCAATCCCCAGGGAAGACTAACAAGA 
AGTTCTCCTAACAAAGGTGGGTCTCCCCTTACTAAAAACTCCTGTAATGG 
CTGAAAAGAGCATGAGGTTTTCTGCATATCATTACACATTCAATAGAACG 
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TCATGCAGCTGTTAAAA^xGATCTGTAG. ^GGCTATCTTGTGACA^AA^ 
GCATTGGAGATATACTGTTAGTGACAAAAATAGGTTATAAATGAATTTTT 
CCATGCATGCC7CTATATTTATAAATACACACACATAAAAGACAGGAAGG 
ACAGACATTAAACATTCATAGTGCTTAAGATGATGCATAGTATAATAGTT 
AGGACCATGGCCTTTGGGACAGAAAACTACAGCCTCTCTCCCACTTATCA 
GCCATGGGACCTTGGGCAATTTGCTCAGCCTCAAAGCCCCTGTTCCTTTA 
TC^GTGTGCTGGGGTTGTTGTAAGAGTTAAGTGCAATACACAGAGAGAGA 
GAGAGTACCTAACATGTATTATGTGCTCAGTCAATATGCATCATAGTACT 
CATTGTTACATATGTTCCTAAGTGCTTTATACGTTTTTTCCCTAAGTTGA 
CCATCTGTTTTTGGCATTATGAAACATAATGATCCTAACAAATTAAAATT 
AAAAACATAAAGAATATTTGCCCCAAAAAAATAAAGAACATGAATTCTTC 
AAGTAGCCAAGGGGCCATAGACAGAAGTAAGCCCTTGGTGGGGCTTAGTT 
GAGAGAAGTCTCCAGAAGGTCTTTCGTGTGTTAAAGAAGAGGGTAACAGG 
GAGGAGGTGGGGAGAGATGTTAACTGAGTCTAAATGAGCACCTGGAAGAA 
GAGATGGGACAGGCCACTTCTGCCTGGACTCCCTGATTGTTAAGAAGAAT 
GAAAAAGAGCAGAAGTCTTCCCTGAGCCCAACTTCACTCCCTGACTTAAC 
CTAGTCTTTGCCCCTTCCCTCTCACTCATGGCTACTTTCTGTGGTCACCT 
TGTTGTAGAAATGGATGTGCAGCCACCTCATCTTTTTCTACCTCCTTCAC 
ATGTTTTAGATAATTTAATGTAGTAGAAGACGGTTACAGCAAAAAATTAC 

AAAAATCAAAATATCTCTG.CTATCTACTGTTGCATTTCTAACCATCCCAA 

AACAGTAGCTGAAAACAGCACTCGTGGTCGAGCGCGGTGACTCATGCCTT 

TAATTC CAGATACTC CGGAGGCTG AGGCAAGAGAATCACTTGAACCCGGA 

AGG7GGAGGTTGCAGTGACTCAAGATCATGCCACTGCACTCCAGCCTGGG 

TGACACAGTGAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAGCACTCGTG 

TATTTTGTTCAAGATCTGTGGTTTGGGCAGGGCAGGGCTCAATGAGGACA 

TCTCGTCTCCGTTCCCGCAGTGTCAGGAAGTGTAACTGAGACTGGAGGGT 

CACACAGAAGATGGCTCCCTCAAGTGGCCAGCAAATTGGTGCTTACAATT 

GACAGGGAGCTGTTGACCAAGGGCCCCAATTCCTCTTCCTATGGCCCCTT 

CTCGGGCTGCATGGGCTTCTTTACAGAATGGCAGCTGGATTCCAAGAGCA 

AGTATCACAACCTACAGAAGAGTGGAGGAATATTGAAAGTTCACAGTCTC 

TTAAGACGTTGGCCCAGAAACTGGCAAAAGCTTCATTTCTGCCATGTTCT 

ATTGATCAGTCACAGAACCTGCACCAATTCAAGAGGAGAACATATAGAGG 

ACATCTCTCAATGGGATAAGTGTCAACAAATTTGCATCTATCACAATCTG 

TCTTTTGGGTACAAACTATTTCTATTCCTCCATTATGCAAAATATACTCA 

CAACCTCCCAGGGGTCGCAAAAGCCTCATCCATTTATGGCAAATGTGGCC 

CTTTTAATTTATATAAAATAATTTGCGGGGGCTTCCTTTATATTTTTAAC 

TCCCCTGC 

>Conticr3 5 

GTGCAGAGAAGTGATTTAAAGCCCTTCAGAAAGAATGCTTTATTCCCGTG 

GAATTTGGTAACTTGCTTGGGTGTGGGGAGGTTTGTCAGCTTTCTCCACT 

CAAATTATCAGACCCTTTCCATTTAGTGGTAGACCATTTCCCTCGTCCAG 

GCCAAGGGCACATAGTACAGAGAAATAGGGAGTTGTTACCCAGGGAGAGA 

ACTTGGCTCTAAACCTGTAATAGAAAGGTCAGTTCTGGTCTGGAGGGTCA 

ATTTTGATCTTTGGCTCAGATCCAGGAATTGGAACCAAGGCTTTTGAACA 

TTTTAATGCAGGGGATTAAAAAAATGATACGAGTCATTCACGAATATATT 

TGCTTAACATCTAAAGAGATCCCTCAAAACACTAGAAAAAATAAGAACAA 

AAATCTAATAAAACAAAATTTGTTAAACACATTTACCAAATTTTTTTTTT 

TGGTAAAAATTCAAATGTCATAAATAAAGCTAAAGTTCCTCTTGATGACT 

CGCTCCTCTGCCCTATTCCACTCCAAGTAACCACTATTATCAGTCTTGCC 

AATACCCTTCCAGACCTCTCTACCTCTATATACCATTAGAAGCACATGGT 

TTTGCATTGAGGATGTGCAGTGTTTTGTTTTACGTAAATGTTATCACTCT 

GTTCTTGTTCCATAATTTGCCTTTTTCTCTCAATGATTTGCTTGGCTATC 

TTTCTATTTCAGTAGCATCTCCTTTCTTTTTAACTTACCATTGTTTATTT 

AACCTTGCCTCTATCAACAGATATGTAGGTTGTTTCTAGTTGATTTCATT 

AAGTATTTATAAACAACGCATCAGTAGATGTCCATAAATTTCTTTACGGA 

AGATGGCAAGTAGTGGAATTGCTGAGCCAAAGAACATGTTTAAAAAACCC 

AAAAAAACTAGACGCTACCAATTTTCTCTCCAAAATGGCCATACCCACTT 

ACCCATACAGAGATGATTTGGAATCTGGCTTCCTCACAAGGTGAGATGCC 

TTCACAGTTTCATTCTTCCTGGCATGTCTTCCCTTTTGTATCTGAGAGAG 

CTGGCAGAATTGTGTCACTAAATCAAGGATAGAGGGTCAAATGACAGCTC 



FIG. 4 (13 of 61) 



Mill* 




WO 99/06426 



PCT/US98/16102 



AAGCTCACAGGCACCTCrGCTTTCTTCUwAGACCACCTGCTTTC^ 

CCAGCTCTGTTCCATCTTATAGAATGGTTGCCACTTGGGTGTCTGCTCCG 

ACAGCCATGTCATCCTTTGCACTGCAGTTATGAAGCAGACAGAGCTAGGA 

GAGGGGCTTTGCCAGCCTCTGCCCTAGCTTGGAGAACTTCAAAAAAGGAG 

GGTATTGAAGTTGAACTCCCCCAAAAAGGGGTGGTCCCCACACCTCAAAA 

AGTGGTGCCTCCGAAAGAAATGTAAAATTCGTGTGGGGGGGGAAAAAGGT 

TAJTTAGAAATTGTTGGCTTGTCGTGCCGAAAGTATGTGTGGTTACGGGG 

AGTACGGAAATTTCGAGGGGTGGGGGCGAGGCCGTGTGTCCTTTAGCCCG 

GGGTTT7CCCGTCGCATGTTTAAGGGGGGGGAAGAGGGGGGATGTTTTCT 

TTCCGCGAAGGTTTTTGAAGAACGGCGTGG 

>Concig36 

CCCCCCACCGCCACTACTCAACCGGCCGTTCACGAAACAACTCGCCACAT 

CCACTAACCCGCTGGCTCACCACCCACCGCCCTCCCGATCCCCCCAATCC 

AAACTCAACCCCCACCACCAAGCGCCTCCCCCCTCCCCCACCCTCCAGCT 

CAGCCCCAACCTACCACCAACCCCGACTCGCCCACCGAAAACCAACAGCA 

AACCCAAATGCCCACAAAACCAGTGTCCAAACCCTCCTTCCCATCAGTTT 

GGTGGGCCCATCACCGCTTCCCCTGGCCCAGGCTCTCCTTTTGTGCGCTT 

GGAGCAGCAGACTGATCTCCCAGCCTTCACTCACTTCATGTGGTAATCTG 

TTGTGTTCATCACTGTCAGAATCTTCTGCATCCCCTCACTACTCTGCTGA 

AAACACTCTAGTGGTTCCTCATTGCTCATTAATGAAAGTCTAGATATTAA . 

ACGTAGAAGGCCCAGCACAATTTGCCCCTATGCCACCTACCTCTCTAATC 

TTTTCTCCTTACTCTGACAGACTCTCCGTCTGTCATTTATGTATTCTTTT 

ATTGCTCTCTTCTACTTTTAGTATGAACTGGATTTATGGATTTTTTTAAC 

ATTGCTTTCAAGTATGGAATAAAGAATTTTATTTATTTATTTATTTATTT 

ATTTGAGACTGGGTCTCACTCTGTTGCCCAGGCCAGAATGCAATGGTGCA 

GTCATATCTCACTGTAACCTCGAATTCCTAGGCTCAAGCCATCCTCCTGC 

CTCAGCCTCCTAAGTAGCTATGACTACGGGTGTGCATCACCACATCTGGC 

TAATGGAATAAAATATTACAATGCCTAATCTTAATTTTCAAAATTTTAAA 

TTACATTGTACCTAATGCCCATGCATTTACTTTTTTCAGTGGGTCAATAG 

CCCTCACTTTGGCAAAGGTCCCAGGCCCAAGGTAAGGCCTTACTTTTTCC 

AAACTCATCTTTTGAAAGACATAAGTGCCTGTAAGTTGTACCACATTAGG 

TTCTAGGAATTTTTCATCAAAGACTTTATCAGACTATTTTCCTCTAAGTT 

GAGAAAGAGCTGGGGGCAGAATATGGCACTGAATGACTGAAGAGAAGGCA 

CTGAAATCAGGCCAGAGGTTGCTGGAAAGAGCAATGAGGAACACCAGCAG 

CAATGAGGAGCCGGTGATGATTTTGGCTTCACAGGGAGGTGTGTACCACA 

CCGATTTTATCTCTACGTGGATGAACCACAGCTGTCGGCTCCCTTGTCTC 

CAGGACATCACACTCTCCACATTCCCTCCCATCTTCCGGCTTCTGCTTCC 

CGGGGCCCTCATCTGCCCCATCCTGGGTGAACACTGGTCGGTCAACTGCT 

GGGCGTACCTTCCCGCTCTGCACACCCTCCCTGGCCACCCCACCCACTCT 

CACGGCTCGCACTGCAGAGGAGCCGCATCTCTAGCTCCAGCCCATCTGCC 

TCTTCTGAGCTCTAACTTCATGTAGGCGACTCCTGCCGGTGTTGCCTCAC 

AGGCCCATCATACTTCAAAGCATTTTCCCCTCAGAACACCATGTCCTGGC 

TGCTCCCTCCAGAAGATACATCTCTCAAGCACATCCCCGCGGCTCTCACC 

TGGATGACTGCATTCACCTTCTCCCACATTTGCCCCCTCCTTTGGATGTA 

TATAGATTGTTTTAAAATACAAATCTGATGTGCTTGCTCTCCTGCTTGAA 

ACACCTCAAAACTGCCTTCAGGATAAACCACTGCCCTTGACATGTTCACA 

GGTTGCCCATGGCCTGGCCCTGCCCATCTCTTCAGCCTCATCTCATGCCC 

CTTGCCCCTCGCTCTCTGGGCTTCTGCCTCCCTAGCCCTCCTTTAGGTTC 

TCTAACACACCATAGTCCTTCTAGTGTTGGGGCCTCTGCAAGTGCTGTTC 

CCATTGCCTGAGACATGAATCCCTCTCCCTATCTCTACCTGCACCTTCAT 

CTGATTAATCCCTACCCTTCCTACTCATGATGTTGCTTTCTCAGGGACTC 

TCTCTGACTTTTTAAACTAATCAGGGTCTCCCCAGTATATATCTTCATAG 

CACTCTGTATTACTCCTTTCTTAATGACCACCTGCTGTAGACAGAATGTT 

TGTCTTCCTCCAAAATCATATGTAAAACCTTCCACCAGAGCGATGATTAG 

AGAAGCCTCCC 

>Contig37 

GACTGACATTCAGAAGATATTAATAAGAGCACTAATGATGGGGATTGCAA 
CCATGTCTTTACTGACTTCCAGAAGCTTCTTACAGTAAACATGAAATCAC 
ATAATTTCTTCCACTTTCCTACTGTTTCTTGTTCTGGGCTCTGTCCTGCT 
TACTGTCTAATATCTTGGCCCCT7AAAAGTTGCTAATCTTCCAAACCTCA 
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ttcctgtgactgggccbwtggtccttgx - catgggc cttgaagatac1 

ctgtacacttatctggagcatccagtgcctaccacctgacccagattcct 

cattgcgctcctccctcctccacctaatgggatttgctcatacccgtgtg 

ggacccctcccattttccccaactgaatacttatcaagacaacgcattgc 

catactccctcgtaccctgctctgggcatcagactgaatgtttgtttcca 

ttgaggatctgcagctgcatcag7ttccccagcaccgtccaaccccttga 

gcatggctagtcctaaagcagagaattagcctttctatccctgctgctat 

actatgctgggacaaataataagaaatgacagcattttatgataatgcagg 

ctgcaggaggcaggaggcaggaatcaaattcgtgc7tatcaaatagtgct 

ccaattctttgaatattggactatagaatatgtcatggatctatgctcag 

gtgggttccctattactcactccactgaggccaggttgtgggattagctg 

tccaagagggagtttcagtctcacagcatagggtcattctgagaattact 

ggcccacacttgtgtggagacctccagagaacagaatctgggttggtgcc 

atgtacttccaggaggagagaagtggcaggatgcccagccccacaatcag 

aggggaaggggcagagccacatgtatgaagatcctctccccagtacgtgc 

caatcacagggcttcctagcttttgggccaaggaaacaatgtgggaagca 

aaaaaggacaattttctcctccctttgcatgaagactgagcagttttacc 

agattcccagggaaacacccttccactctgggttgaatgtgagtgagaga 

cattcagctggaacactagaaaaactatttcctgagccactcacctttag 

ccctagaaagtgttggatttgtccttcatctttgccacagtagagactgc 

tgatagcatcagaacttgggctctggjSttagacagatatgggta^ 

ctgagctctctcacttattagtgtgggatgtagagcaacttttaaaatcc 

ttccaaacctcagacttctcatgcatgatgtgaggattgtaatagggccc 

acctaataggggtttttgagaattaaaaaagttattcaatgaacagcatt 

tagcaagatgcctgaccattgagaaaataacaaattgtttattattattg 

ttattattaaacatctttcctgcaccttctgactgggggcatcgtatcat 

cagaaatac7taggatgggatggattcctgcatgggctgagtcaagggtg 

caataatggaggagtgaagaaggaagaaatggaggcagaaatccccagga 

gcccagcatggtacaaggctgagctagtgctgcagagcctccttggaaca 

gccacagagcttgcatctggccctggaggaacctcttctagctggcagga 

ccagccacaacagtggccaggggatttcccagggcgtgggctcctcagga 

gttcatttggaccaagcctgcctggagaggggttataacagggatccttc 

cctactggcaggtgatttacccctcggtgagaagctcaggcatttgtttg 

atggaaggtggaaggccctgtgctgggccagtgactatcagggatgggcg 

ggtggctggaaaatagcaaataagacaatatgataacacagttaaccacc 

acactatgtgaagctacaatatgggtatctgtaatagacaattccaatgt 

agagaataattttaaggtgtcattctccccgccaatgccataagcacacg 

gcctctgcctgggtttctcactgtggaatgtcctcctggtctcctcatgc 

ccagagagtgggaagtactcctactttaacaccggctttcctgtcatttc 

cntgcagccctcctcagccccctctgcacagggaggtttcctccctgctg 

ctgcagtgctttgtacttgttagtggtacctgcacacaggtattggtgtc 

cttgtctcaccaccctacatcactgtaagctccccaggagcaggcttcct 

gtttgactcacctgtgatcctccacctcccaccctgtagtgcctcaagca 

ttctgtagagcacatggacgcc 

>Contig38 

GACTAATAAGTACTTCATTATTTGGGTATTTTCCAAGAACAACATATTGT 
AGGAAACCATTCTTTCTAAAAAAAAAAGTGTCCTTTTAAAAAGGTGAATA 
ATTTTTGTCTAATTCAAAGTTTATTGAAAAGTTATGTATAAAACAAGGTA 
AAAGGAACAAGGAAATAAGGGAAATGTAAAGAAAATTATAGAAATAAAGT 
GGTATTTTTTGGTAAGAAAGCTTAAAGAGAAATAATTTTAGGTAAGAAAG 
AATCTTACCTAAAATTTTGTGCTAGAATAAAGTGACTGGCTAAGAAAGGG 
ATGTTCAAAGCTATTTATGACAAACCCACAGCCAATATCATACTGAATGG 
GCAAAAGCTGGAAACATTCCCTTTGAGAACTGGCACAAGACAAGGATGTC 
CTCTCTCACCACTCCTATTCAACATAGTATCGGAAGTTCTGGCCAGGGCA 
ATCAAGCAAGAGAAAGAAATAAAGGGTATTCAAATAGGAAGAGAGGAAGT 
CAAATTTTCTCCGTTTGCAGATGCATGATTGCATATTTAGAAAACCCCAT 
CATTTCAGCCCCAAAACTCCTTAAGCTGATAAGCAACTTCAGCAAAGTCT 
CAGGATACAAAATCAATGTGCAAAAATCACAGGCATTCCTATACACCAAT 
AATAGACTAACAGAGAGCCAAATCATGAGTGAACTCCCATTCACAATTGC 
TACAAAGAGAATAAAATACCTGGGAATACAACTTACAATGGACATGAAAG 
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ACCTTTTCAGGGTGAAn — GCAAACCAC* ^CTCAAGGAAATAAGAGAGu^A 

ACAAACAAATGGAAAAACATTCCATGCTTATGGATAGGAAGAATCAATAT 

CGTGAAAATGGCCATACTGCCCAAGTAATTTATAGATTCAATGCTATCCC 

CATCAAGCTACCATTGACTTTCTTCACZAGAATTAGAAAAAACTAATAGCC 

AAGACAATCCTAAGCAAAAAGAACAAAGCTGGAGGCATTGrGCTACCTGA 

CTTCAAACTATACTACAAGGCTGCAGTAACCAAAACAGCATGGTACTGGT 

ACCAAAACAGATATATAGACCAAAAGAACAGAACAGAGGCCTCAGATATA 

ACACCACACATCTACAACCATCTGATCTTTGACAAACCTAACAAAAATAA 

GCAATGGGGAAAATAATTCCCTATTTAATAAATGATGTTGGGAAAACTGG 

TTAGCCATATGCTGAAAACTGAAACTGGACCCCTTCCTTACAACTTATAC 

AAAAATCAACTCAAGATGGATTAAAGATTTAAACATGGCTGGGCATGGTG 

GCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGATGGGTGGATCAT 

GAGGTCAGGAGATGGAGACCATCCTGACTAACACAGTGAAACCCTGTCTC 

TACTAAAAAATACAAAAAATTAGCTGGGCATGGTGGTGGGCGCCTGTAAT 

CCCAGCTACTTGGGAAGCTAAGGCAGGAGAATGGTGTGAACCCAGGAAGT 

GGAGGTTGCAGTGAGCCAAGATCACGCCACTGCACTCTAGCCTGGGCAAC 

AGAGTGAGACTCCATCTCAATAAATAAATAAATATGGAACTCTCCCAACA 

CAATAATAAGACAAACCCCCAAATGTTTTAAATGGGCAAAAATATTTGAA 

CAGACACTTCACAAAAGAGGATATGTAAATGGTCAAAAAGCACATGAAAA 

GATGTTCAACACCATTGGTCATCAGGGCAAAGAAAACTAGAACCACAATG 

AGATGCCTCTGTACACCACTTAAATGTCCAAATTAAAGAAAACAAGTTTT 

GGCAAAGTTGTGGAGCAACTGAAATGCTCGTGTATTGCTGGTAGAAAAAC 

AAAATGGCATAACCATCGCAGATAATTTGTTGTCAGTTTCTTACAAAGTT 

AAACATATACTTATTGATATGACAGTTCCATTCCAAGAGAAATGAAAACA 

TAAGTCCACACAAAGACTTGTACCTGGGTGTTCATGGTAGCTCTATTCAT 

AATTGCCAAAATCTGGAAACAAATCAAATGTCCATCAGCAATGGAATGGA 

TATACAAATTGTGGTACACATGTACAATAGAAAACTACTCTGCAATGGAG 

AGAAATTAACCATTGACAAACACAAAAACATGGACAAACCTCAAAAACAT 

TATGCTGAGCAAAAGAAGCCAGACACAAAAGACTGCTCAGCGCATGATTC 

CATTCATATGAAATCACAGAAAGGGTCAGTTGAAGGTGCAGAGACAAAAA 

GTAGATCTGCAGTTGCCTGGGGATGGGGTGGGAGGTTGACTGCTCTGACG 

CGTAAGGAAATTTGGGGGTAGGTGGGGGATGGTGGGAATATTTTTTGAAT 

TGAATTGGGTAATAGTTTTAATAGGTAAAATATTGGACCCCACAGTATTT 

GAGATAGGTTTCAGTCAATTTAGACAGTTTATTTTGCCAAGGTTAAGGAT 

GCATCCGTGACCCAGCCTCAGGAGGTCCTGACAACCTGTGCTGAAGGCAG 

TCAACATACAGCTTGCTTTTATTCATCTTAGGGAGACATAATACATCAAT 

CAATGCATGTAAGGTTTACATTGGTTCAATCTGGAAAGGTGAGGGAACTT 

GAAGCAGGGAGCTTCCAGGTTACAAGGTAGATTATTCTCAACAGAAAGGA 

ATGTCTGGGTTATGATAAGCGGTTGTGGAGACCAAGGTTTTATCTTGTAG 

ATGAAGCCTCCGGGTAGCAAGCTTCAGAGGGAATAGATTGTCAAAGTTTC 

CTATCAGACATAAGGTCTGTGTTGATGTTAATGCTGGTCAGCTTTTCCTG 

AATTCCAAAAGGGAGAAGGGTATACTGGGGCATGTCCAACCTTCCCTTCC 

ATCATGACCTGAACTAGTTTTTTCAGGTTAACTTTGGAATGCTCTTGGCC 

AAGAAGAGGGGTCCATTCAGATGGTTGGGGGGGCTTAGAATTTTATTTTT 

GGTTTACAGTGAAGACTTTTCAAGCTAGACACTTAAATGAGTATGTTGCA 

AAATGGCAATTTCTTAGCACGGC 

>Contig39 

GACGTCCTAAAGAAATGCTAAGGTAACTCAATTAACTATGCTAGAAAAGA 
GAGTTAAGTATTTAGGAGGATTTAATATGGTGTTAAAGTTGTGAAAATCA 
AAATGGAGACACTAATGTTAAGAAAACCCTGATAAATGGAGCCAGGGAAG 
GCCATGAAGAAAGAGTTCTCACACTTGTATCCCTGATCATGAAAAAGACT 
CTGCAAAAAACAAAACCTTGCACAAAGGCCATTGCAACCTTACACAAAAA 
ATACTACTTTAAAAGGACATGTGCCCAGCAACTGCCTGTCCAACCTCAGA 
CTGGCAATATCTTTGTTATTGATCTTAGTAGCCCAGCATAACTATTTCAA 

GTAAAAACCTTTGTCTTCTTTACTTACCCTGAATATGCACAGAGTTTACT 
ATGGAGTGCATATTCC7GTTGCAATGCTCTATTCCCAAACAAACATCATT 
TTCTTTTAGAGAGCCTCTCTCTGTTTGTGATTTAGGTTGGTGATGTAAAG 
CAATGGCATAACTGAACACTGATTCAAAGAAAAGTGGCTTTTCTCTTTGT 
TGTATTAAAAAGAGGCCTTATAAATAGGATAGTAAGATTTGTAAGTTGAA 
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CTTAAAGCATGAAGAAXATTTAG^ 

AATCCCAGCACTTTGGGAGGCCAAGACAGGAGGATTGCTTGAGCCCAGGA 
GTTCAAGACCAGTCTGGTCAACACAGACCTCATCTTTACTAAAAATAAAA 
AAATTAGGCCAGGTGCAGTGGCTCATGCCTGTAATCCCAGCACTTTGGGA 
GGCCAAGGCGGGAGGATCACTTGAGGTCAGGAGTTCGTGACCAGCCTGGT 
CAACACGA7GAAACCCCA7C7C7AC7AAAAA7ACAAAAAAA77AGC7GGG 
TGTGG7GGCGGGCACCTGCAATCCCAGCTACTCGGGAGGCTTCAGGCAGG 
GGAATCACTTGAACCTGGGAGGCGGACATTGCAGTGAGCTGAGATAGTCC 
CACTGCACTCCAGCCTGGGCGACTCAGCAAGACTCTGCCTCAAAAAAAAA 
AAAAAAAATTAGTCAGGTGTGGTAGCACACAGCTGTGGTCCCAGCTACTC 
GGGAGGCTGAGGTGGGAGGATCATCTGAGCCCAGGAGGTCAAGGCTGCGG 
TAAGAGCTGAGATTGTACTACTGCATTCCAGCAGGGGCTACAAAGTGAGA 
CCCTGTCTCAAAAAAAGAAAAAGAAAAAGAAAATTATGTTTTTAAATTTA 
TAATTATAATAAATTTAATTACATAAATTTAAGCTCAAGTAATTGTAAAT 
ATTCTTTCTGTGCACATAAGTTATTCTTGTATTGACCCCACAGGAGCTGG 
CCATTCTTCAAGTCAGAAGGCCTGAGAGAGGAGCTGCCCAGSTGGTCTTC 
ATGGGGCTGTGCGGCCAGTCATCCCCCACAGGTTGACAATCCTTGTGTAC 
TTCATCCTCGTTGGATCCTCTGTATCCCTGACGATGAGCAACTGTGAGGC 
CCGTTTCAGCACTGAGTTCCAGTCAGGAAAACATCCACCCACCCACCACA 

CGC7CACAC77ACACACACA77CACACA7GCACACACG77C7GGC7CCGA— . 

AAAAGAAAAAAAAAAAGCAATTTAAAATAATTCTGATCCTTTGCxTATTT 

CCACAAACTCCATGAAAATTGTACATTGTCCAAGCAACATTTCTTAATAT 

TC7C77777C7C7CA7A7CCA7777CC77AC7GC7G7C7CCACC777C7C 

TTCCAAACTCCCTGTTAAAATCCCTGCCCCAGCGAACTTTTATTCAATT7 

TGTGGAATGGAGGCTGCTCTGATTTAAATTAAAAAAAAAAAAAAAATCCC 

TACTCCATGTCCCAGATCCCTAGTTGTTTTTTGTTTTTTGTTTTCCTGAG 

ACAGGGTCTTGTGTCTTCCATGCTGGAGTGCAGTGGCATGATCATGGCTC 

ACTGCAGCCTCAACCTCCTGGGCTCAAGTAAATCTCTTGCGTCAGCCCTC 

CCCAGTAGCTGGGAGTTCAGGTATGTGCTACCATGCCTAGCTAATTTTTT 

TCTTTTATTTTGTAGAGACACGGTCTTGCCAGGTTGCCCAGGCTGGTATA 

GAACCCCTGGGCTTAAGTGATCCTCCTGCCTCGGCTTCCCAAAGTGCTGG 

GATTACAAGTGTGAGGCACTGCACCCAGGCTGGATCCCTGCATTTTTACA 

GATTTAGCATCACAAAAGTCTAAACAATTAGACTGACTAAGGCAGAACTG 

CCCTTATGACAGCAGACATAAGAAGGAAAAGGCCAAAACACTGTGTTAAA 

AATTATCCAAATGTGAGGAAAAGGCAAAGAGAGTAGGTGTGCCTTTTTAG 

TGTCTAAGCTGCCTGCCCAAGGGGCATCTGATGCTCTCAGGCAGGAGTCC 

ACAAAT7TTTTTTTGTAAAAGATCAGATAGTAAATCTTTTCAGCGTGAAG 

AGCATGAGGTCTCTGTCACAAATACTCAACCACCATTACAACATGAAAGC 

AGCCAACAGACAACACATGACAAATGAGTGTGGCTGTGTTCCAGTAAATC 

TTGATTACAAAAACAGGCAAGAGGCCAGAGCTGACCCATGGGCCATAGTT 

TGCTGACCCCTTCTGTAAAGGAAAGTATTTTTGTTTGACTTGCTGTTTAC 

CATTGATTGAACACAAGGCTCTGTAGAGTTACTTGTTAACTTGCAGAAGA 

TTGATGAGTGGCAAGTAATTTTTATTCACCAGAATATANNATTATTCTGT 

TCAGTAGATAAGATAAACCCACTGTTATATTACTGTCTTGTTTAGAATGT 

GACTTTGATTCATTTTTTCACAAATTCATATTATTGCCCTAATTTGTATA 

TAAGTATGCTTCTTTTAAAAATATATATTTTTTAATAAATTTGAGACAGG 

GTCTCACTAGGTTGCCCAGCCTTTTGCTATAATGAGAGCATAAAGTGAAT 

TTCACACTTTAGCCTAGTGCATAGATGGGATTACAGGCACAAACC ACTGC 

ATGCAGCTAACTTTGCTTCTCATTCCAGCACGTTCTATTCCNNNGNTTTT 

CATATACGCGTCTCTTAATGC 

>Concia40 

CGCATTCAGCCCAAGTTTTCTTCAGTGTTAAGGTTTTTGTTACTCTGTGC 
CCAAATGTCCTTCCAAAAAGGTTAAGTTTTTTTACCTTCCTGCCAACATT 
ATATGAAAGTGTCCACTTTTGTAGACTTTTACCAATGCTGACTACTTTTG 
GTTTCAAAAAAGCTCTCAGTAATTTTCTATTAATTACTTTTACCCTTTTT 
TATTGAGGGTGTTCAACTTTTTATTGTTAGCATATTCTCTCTGGGCTCCA 
TTGGACGCCTTGGCAGCTTTTTGGTAGTAGGTGCCTTTAGAAAAGTCCTT 
CTCGTC7GGCCCTTTCTGAGCAAATCTAGTGAACAGAATTGGC7CCATGC 
7CAGCA77GC77AA7ACGG77GA7CCAGGGCC7AGGAC7CA77CC77CA7 
TACCA7CCAC77GCA77G7C77AAAGCAAGGC7C7A77AA777AA777GG 
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CATTTCCTGTCCCAGCTCVfTAGTTTCAT^ 

TCCCAGTAGATGCCTATGTTGCTTCCTTTTAAAAAATTTTGGAGCTGTTT 
CCCTAGCCTAACCTTTTCTTCAGGGCAGGAGTTAAGTCCCTTCTACTGCA 
TTCCTGTGAAGATGGTGATTCAAGAGGCAGGGCACCTGTTGCTT7GTGAA 
ACAGTCCACTCTGCAGCTGGGCAGCTCTGTTACTAGAATGTTCTCCCTTC 
TGGGGAGCCAATATTTTGATGTCCTCTGTGAATCTCATCTGCTTATCCCA 
TCTGTTTATGTCCTTGAAGATGCACAGGTCTGACACCACGAGGTAGCCCT 
TAGAAATTTGATGGCATTTCTGATGTGTCCCCAACTCTTCTCCAACCACT 
CCTCCCAGAGCTTGTTTCTTAAGCCCCTTGTGGAGCTGATTGCTTTCCTC 

aaggcagctcagtttttcccagtttgctcctggtggtcctgaaatatgat 

tgactcctgaatactccaggtgtgaaggagagtgggggtggcctttctac 

ttgtcatggcctgggttttaagttgctgtccagtggagcagaggtgactt 

tcccagtgaactacattttttcccctctaaatccttagcaattttgtctc 

cagaggcaagacctggccaaaccatttgtgttgaggattgaatcaagaat 

gattgaggagatgacagtagtcccccctcatctgaggagggcgtgttcca 

agcccctcagtgaatgcctgaaactgtggatagtacccaactctatatgt 

ctatgattttcctataaattaatacatgcctgtgacaatgtttaatttat 

aaattaggcaaagaggccaggcgcagtggctcaagcctgtaatcccagca 

ctttaggaggctgaggcctcacctgaggtcaggagttcgagaccagcctg 

accaacatggagaaaccccgcctctactaaaaatacaaaattagctgggc - 

atggtggcaggcgcctgtaatcccagctactcgggaggctgaggcXggag 

aatcacttgaacccgggaggcgggatttgcggtgagctgagatcgtctca 

ttgcactacagcctgggcaacaagagtgaaactccgactcaaaaaaaaaa 

aaaaaaattaggcaaagaaagaaattaacaacaataagtaatgaaataga 

acaattctaacaatatactataataaaagttgtatgaatgtggtctcttt 

CTCAAAATTACCTTTTTTTTTTGAGACAGGGTCTCACTTTATTGCCCAGG 

CTGGAGTGCAGTGGCACGATCACAGCTTACTGCTGCCTCGACCTCCTGGG 

ACCAAGTGATCCTCCCACTTTAGCCTCCTGAGTAGCTGGGACCACAGGCA 

TGCACCACTGTATCTGGATAATTTTGTTTATTTTTTTTTGCAGAGAGAGG 

AGGTCTCACTATGTTTCCCAGGCTGGTTTTGAATGCCTGGGCCCAAGGGA 

TCCTCCTGCCTTGGCCTCCCAAAGTATTGGGATTACAAGCGTGAGCCACC 

ATGCCTGCCCCAAAATTATCTTATTGTTCTATACCCACTCTTCTTCTTGT 

GATGATGTGAGGTGATCCATTGCCTCCTTGATGAGATGAAGTGAGGTGAC 

TGATGTGGGCATAGTGATGCAGTGTTTAGGCTGATATTGGCCTGATGATA 

TGTCAGAAGGAGGGTCATCTGCTTCGGTGATCCTGGATCATAGAGTCATG 

ATGATGTCAATGGTTGGATGTCAGGAGCAGACGATGTCAATGACTAACGA 

TAAGCTGGACAGGTGGGATGGTGGCACAAGATTTTATCACGCTACTCAGA 

ATGGAGCACAATTTAAAACTTCTGAATTGTTTATTTTTGGAATTTTTCAT 

TAATATTTTTGGATTGCAGTTGACTGTGGGTAACTGAAACTGTGGAATGT 

GAGACTGTGGAAAAGTGAGGGAGTACTGTATTATGGAACTGTAACTCTAT 

TCGGTAGGGGAACAGAATTCACATTTGTGGGGCCCAGGTCTCTGCATCTG 

TAGGGATCCAATTGTTTCATTTCTCGTTGTAGCAAAAACTTGGCTTTGGA 

ATCAGACAGATTGATGTTTGCTATCATTCTAAATGGGTGCAGCTACACTT 

TCCTCAAGAGGTAGTTCTGAAAATTTAACAAAATGTGAATTTCTTGGTAA 

AAAAAAAAAACCTCAAAAATATTCAGTTTCCTTTCCTTTGTGTCTGATGT 

ACTCCATCAAATACTGGGAAATATGTGTCTCTCATAGAAATGTCATGGAT 

CTTTGTAATTCTGATTATCCACAAACCTTGGGGATTAGCTGTTTCAATGT 

TCCTATTTTACAGATAAGAAAATGGAGCCTGTGGTAAGTTAAGTGAGTTA 

CTCATGGCTACTTAACTAATATTTTACTAGGTGATAGGCCAGAGCTAGAG 

CCCAGGTCACCTTCTTATCAATGCTCTGCCTTGTCTCTGTGCCTTCCTGT 

CTGTCTGTATGTGTATGTGCCTGTTGACAGTAAGGCATAGTTTAACCCAG 

TAGAACTACCGGTTTGTAATGAATTCCACTTGTAAATGACTGACCATTCA 

AGGAACAAGTGTTTTTTCTATGCTTGACACCTGTTTTGGATGCCAAAAAG 

GATACAAATGTAACTTCAGACACTCTGGGCCTCATTTTGCACTCATTAGC 

ATGTCCAAAATTAAAAAGACTGACCACACCAAATATTGGTGAGGATGTGG 

AAGAACGGGAACTTTCATACACTGCTGGTGGGGATGTAAAATGGTACAAT 

CCCTTTGGGTAACAGTTTGACAGTTTCTTAAAAAGTTAGACATATATATT 

TACCATATGACTCAGCCCTTCCACTTCTAGGTCTTTACCCAAGAGAAATG 

AAATGCTGTGCTTTTACAAATGTCTATACAGGAATGTACATAGCAACCTT 

ATTTG7CATTGCAAAAAACAGAGACAATTCAACGTTGTCAAGAGTGAATG 
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GATGAGCAAGCTGTGGTs. 'JTCTATGCAr** GGTATCCTACTCAfiqOSp; 

AAAGATATGGCTAAT 

>Concig4l 

GACAACAATGTCATGCATAAGATGACGATGGCCTGGGTGATTGATGCAAA 

CAAGGATAAAGAAAATAATCAATTTTGTCC CCATTTTCAAAGACAGATAG 

CAGCAGCAAGAGTGTAAGTCTGAGGAAAGTCATATTCCTTCCTCCTACAA 

CATAGCACACACACTTACAAAAACAATACACAGACTCCTGGCCAATGGAC 

7TCAAAACTGAGGAGGATCATTAAATTTAAATGTTCACCGCTGCATGAAA 

TCTCCCTGGGTCCTGCCCTCCCTTCCCCACCCTCCTCCACTTGGGCCGGG 

GCACAGCAGTGATTCTCTCACCTCTCAGAGTGAGCCAGTGTTGGCTGCAT 

TGAAGGCTCCAGATATGCAAACAGGGCAGATATTCCTGGACCAGGGTGCA 

CAGAGTGAGGCTCCAACGCACCCTATTAACTGCATGAAGGATGAATGAGC 

CTCTGGTATGGGCTGGGACAGAAAAAAGGATTCAAGGGGCCCAAAAGGGT 

TTGGGTGGAACCTACCAGGAGCGGCAGTACAGACTCCTTGGGAAGGTGGC 

CATGA7TTAGCCACATTCACCAATAGGATAATCTGGAGAATTTCCTAGCT 

TGAGTTTCTGGGAGAAAGCAGATTTCTGGATTATCTGGTGACAGGTAACA 

GGGCCGAGTTCATCCACAGCCACCTGCAGTGTTAGCACCTTAAGCTGAGT 

TCCTTGCACCAGGATGCTGTCACGCCCAGTCAGTGTGAGACGGTTCTTGG 

CTGAAGGACTGAAAAGCTTGGGTAAGTGACTTCACCTAAGCCTCTATCTC 

TTGCTCCCGTAAGTCAGGGCTCATTGTGGCTCCTTGCAGGCTTGACTTCA 

GGGTTAACAGAGAAAATGAAGGTACAAGTGCCTTGTGAACTCTGAAACTC 

CAAACCAGTCATTCTCAAAGTGCCGTCCACCAGTCTAGCACATCAGCATC 

ACTGGAAGCTTGTTTGAAATGTAAATTATCAGG7CCTCCAGAGCTATGTA 

TGAATTAGAAACTCTGGGAATGGGGCCCTGCAATCTATTTCAACAGGTCC 

TCCAGGTGATTCTGATGCAAGTTAAAGCCTGAGAAACTCTGTCCTATACA 

AATGGATGTCAACTCAAGCTGCTCTTCAGAATCACCTATAGCACTTGTTC 

ACCCGAATCCCTGAGAATGGAGCTTCAGGACTGCTATTTCTCAAAGTTTG 

CCTGGTGATCCTGAGATGGGGTTTGGGGGACAGAGATCCAAGGTGCTACC 

AGGTGTGAGGAATTGTTAGAAGGCAAACCTGGCTGTCATCTAGGGTGCTT 

AAAGGGTACAGATCCTAGGATTCTGCCTCTTACAGCTGAATCAGACTTTC 

CTAGAATGGGATTGCTGTCCAATGGCATGCCTCCTGGGTGACTCTGATGT 

ATAGCCTGGGCTGGGAACCACCAGAGGATTATCTTCCATTGACCAAGCTG 

ACAAACTCGCTTAAGGCTCTGAGTTTCACACTTGATTTTCTAGCCCCTGT 

CCTTCCATGGATCACCTGCCCCCTTCCCTCCTAATCAGGAGCACAGTCAG 

TGGATGCACTAATGTGGCCTCTCCTTGGCTGCAGGGAACAGGTGGAAATG 

TGGCCATAGGTGTGCAGGGCTGCCTGCCATGTATTAATAGCTACAGATTT 

G AAAG AT C CAAGG ACAAG AG ACT AGAAAAAAATTTAAAACAGC CAAGC AT 

TGGCCCAGTAATGGCATTTCAGAAATCCACCAAAATATTAAGATGCTTTT 

7GAAAAATATCCAGAGCACTCATGTAAAAGTGCTTAATTATTAATAAAAG 

CTGACATGTGTTGGGTACTTCCTGTGGGTCTGGCACTAGGCTAATTATGT 

TTTTAGGAGTTGACTCAAATGCTCCCTGTCATAATTATGTGAAAAAATAT 

AATTATTAGCTCCATGGTACAAATTAAGGAGAGGTTACATAAATAAAAAG 

GAATGATACTCAAATTAGTAACCAGAGCCCATGCTCTTAAACACTATGCT 

ATTATTTGTGGACTCTTACATAGGTGGCAAAAGTCAAAGGCTAGATTGAC 

TTCTGTCCACTTCCAGCCAAGATGAAGTACAAGATTCAGATACACCCTTC 

CGCATTAAACAACTTAGGAATCAGACAAAATATACAAAGCATTGTTTGTT 

ACACATTGGATAACAGACAGCACTAGATAGTCGTGTCTGAGAAAAGCGGT 

GAAATGAGCTGAGTCTTAGAATTGCCCCAGTTTACTAAGGGGCATAGTAA 

GGGCATAGCTGCAGCACAAAGAAGCAGAACCCAACAGAGACTGGCGTTCA 

CCTGAGTTGAGAAAACCAAGTTGAAAATTTAGGAACACTAACACAGATAT 

GTAGGCAAGAGTATCAGAGAGGAGACAGTTGTAGGGAAAAAGAGAGCTTT 

ACAGAGAGACAGCGAGAGCTCCAGAGACCCGCAGAAGATTGCCCTGACGT 

CACTAGCTGAGTACCGATCAGTGCATACATGTAAGGATATTACTCAATAT 

GTGGAAAAGAACAGAAGGAATGATGTCCAAAGCTCACCCAAAGACAGGAA 

TCATTTATGTTTCCACCAGCCAGAGTGGAACAACCTTGTAACGCATATGG 

AGTACTCAAACGAATATTTCCTCAATAATAAGTTCAAATTAACTGAGACT 

AAAGCCTGCCCGCTTTGTCTGGACATGCCTAACAAAGCTTTGAGGGAAGC 

CTCAAAAGAATGAAACCGTGTCCAAGTAATTTAACTGTGTCCCAGAAAAA 

AATTCAAGAACATTTAAATAAATATTAAAATATGATCAAACCCAGCAAGG 

TTAAATTCAAAATGTCTGGCATCCATTAAAAAATTACCAGCCTTGAAAAT 
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tggcgggaaaatatta; -jataatgaa,. .jaaaaagcaatcaagagaj 

aggcctagaaagtatacatatgataaaattagcagacattaaatotttat 

gattaatttattttatatgttaaagaaggtagagaagagcataagcacat 

taaagagagacaggaaagtcccagtactcacacagggccaggagcagttt 

tcaccagtcaggtgggaaaacttcatatttcatggagcattggtagagta 

cacagtgtcttgccttagtagagggataaatgctgttctgttcccgccta 

acccatcttgaaagaaaatctgaaaggatcaaactgtattcaagtaacct 

aatcacatcccagcacacagctcgactagttataaaaacacaaaatatta 

atatctagaaacacaaaaataatatctagcacccaacaaggtaaaattca 

caatgtctagcattcaattgaaattttctaggccatcaaagaagcagtaa 

aatatgacctataaggccgggcacattggctcatgcctgtaatcccagca 

ctctgggaggccaaggtgggtggctcacccggaggtcaggagttcaagac 

cagcctggtcaacatggtgagacctcatctctactaaaaatataaaaatt 

agcccagcatggtggtgggcgcctgtaatcccagctactcaggaggttga 

ggcaggagaatcgcttgaacctgggagaaggagaccgcagtgagccaaga 

tggcaccaatgcactgcagcctcattagagaacatcgggaag 

>Concig42 

gaaactaaaggcttatttaaagcgcgagaccgtggcgcctttggactgga 
ccctttctaatgatcatttagtatcaggctatgtgggagttgaccgtttt 
gcatagcctgaaagccaacagtatcactcctcctctaggtgtggcagaga 
tgtgagagaaggagagtgagagtctgtgggtgtgtatggagtgttcgggg 
aagcgaggcacaggggacaatactgtggtgtagaaaactagtctaaggta 
gcatcaggaaattcatgaaaccaaaatgaatttcataacagcacaagaca 
ttatttgtttttgcctccctctcatttttttttttttttgaaacagagtc 
ttgctctgtcatccatgctcgtgtgcagtggtgcaatctcggctcactgc 
aacctccacctccagggttcaagcaattctcatgcctcagcctcctgagt 
agctgattacaggtctgcaccaccccgccggctagtttttgtatttttag 
tagagatggggttttgtaatgttggccaggctgccctgtcattttttttt 
tactagtgtccagtggagttttttaggggctacataacatgatactgtca 
ttaatctaatggctaatgaaagggatatgtatatgtttttgtgtttaaaa 
caaacttctttggggtcctcaataatttttaagagtataaaggggtcctg 
agatcaaagagtttgagttctgctggactgggacagtggttgtcaaccca 
gattgtacattagggtcatctgggaagctttaaaatagtactgatgccca 
accttaccgcaaaccaattaagccagaatctctgtggatgagaagtcttc 
attgtcatcatcaccatgaccatcatcattgtcaccgtcactacaccatt 
atcatcatcatcatatcatcttcattatcattgttagtatctccatcacc 
atcatcagcatcaccattattatcatcatcatcatccccaccatcatcct 
catcggaacttcacctgcatggaggacaatccactatgcattaggtgcta 
tgctatttgctatactccttattctcacaactgcccagagaggctgatat 
tatctcactttataacaggaggaatctggatcggaaaagttaaggtaagc 
taattcacagagcgagaagagatagagccaggattcgaaaccagttctct 
gctacatcaatgttcccagtccttgcactattgagaacctctttagttat 

GCTTTCACCCCTCCAACACCACAGTAAATTTTTTCTTTTTTTAAAAAAAT 
TATACTTTAAGTTATAGGGTATATGTGCATAATGTGCAGGTTTGTTACAT 
ATGTATACATGTGCCATGTTGGTGTGCTGCACTCATTAACTCGTCATTTA 
CATTAGGTATATCTTCTAATGCTATCCCTCGCCGCTCTCCCCACCCCATG 
ACAGGCCCTGGTGTGTGATGTTCCCCACCCTGTGTCCAAGTGTTCTCATT 
GTTCAGTTCCCACCTATGAGTGAGAACATGTGGTGTTTGGTTTTCTGTCC 
TTGTGATAGTTTGCTCAGAATGATGGTTTCCAGCTTCATCCACGTCCCTA 
CAAAGGATATGAACTCATCCTTTTTTATGGCTGCATAGTATTCCATGGTG 
TATGTGTGCCACATTTTCTTAATCCAGTCTATCATTGCTGGACATTTGGG 
TTGGTTCCAAGTCTTTGCTATTGTGAATAGTGCCACAGTGAACATTCATG 
TGCATGTGTCTTTATAGCAGCATGATTTATAATCCTTTGGGTATATACCC 
AGTAATGGGATGGCTGGGTCAAATGGTATTTCTAGTTCTAGATCCTTGAG 
GAATTGCCACACTGTCTACCACAATGGTTGAATTAGTTTATAGCCCCACC 
AACAGTGTAAAAGCATTCCTATTTCTCCACATCCTCTCCAGCACCTGTTG 
TTTCG7GACTTTTTAGTGATTGCCATTCTAACTGGCACCACAGTAAATTT 
TTATAGATTTTATAAGCAAATTGTATTTACTGTGCAAGAATTGGTTTATT 
TTTTAAACCATGTGTTGCAAACATACAATGGTTAATTGTGATATTTGCTC 
AGTACAAGATCATCAGATCACTACACAGACTTGAGGTAATTCCACCTAAA 
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AGCAAAGAGAACTGACCCCACATTAACTGAGAAGTCTTTACTTATTTA'i. j." 

CCCTATAAACGAGCCAATATGAAGAGAAGGCCTTAATGTGGTTAACTATG 

TAATTTTTTTCTGACTTTTTGAAATACTGAGAAGAGCTCATGACTCTCCC 

ATCTCCTAATTCTACCTTGGTGGATTTTAGACTGACCACAACTCATGGGT 

AAATGAGGGAAGACGAATAAGAAACCTTGCTTTTTTTTCCTCCTTGTTTT 

TGGCTGGCTGCAGTGGCTCACACCTGTAATCTCATCACTTTGGGAGGCCA 

AdGTGGGAAGATCACTTGAGCTCAGGATTTCAAAACTGGCCTGGGCAACA 

TAGTGAGACCCCATCTCTAAAAAAAAAAAAAAAAAAAAAAAAGGCGACGG 

GCGGTGCGTGCCTGTAATCCTACCTACTCAAAAAGCCGAGGTGGAAAGAT 

CACTTGAGCATGGGAGGTCAAAGCTGCAGTGAACCTTGATTGCACCACTT 

CATTCCAGCCTGGGTGACAAAGCAGGACGCTGCCTCAAAAAAACAAAAAC 

AAAACCTTAATTTTTTGGCTATTCTTTTCTGGTAAGAATGGTATAGAGAT 

GGGGATGAGGATGGCTATTGTATGAGAGAGCAAACAGGGTCCAAGCAGTG 

CTCTGGGCTGTCTAAGGACCAGTAGTCAGCTTAACTTCTCAAATTTCCAG 

GGAAGGAGTTCGGAGTGGTAGAATATCCTGGGTATGCCCAAAGCATCACC 

TTGCAAATAGCCTGTCATGAATAATTTGTTTCATTTGTTATGACTGGAAA 

CTGGCTTTGTGTATGCCAGAGAATGGGGGCAGGAAAGAGAGATTGGTGTC 

TTGAGCTCTCTGTGCCTCTGGGGCAGTGATGCTTTTCCTCTCATGTGGAA 

GGAGAGCATGACTGAAAAGGTGCACAAATAAGGTGTCTGTGAGAGAAATT 

AACC^CCAGATACAGAGACACAAGCTTCCCCAAGAGGTCCTCATTGCTC - 

TGCCTTTTTTCCTTTTTTTTGCTTGTTCTACCATTAATAACAGAAACTGA 

TTATGACCTCAAAAGAGAGGAGAAAGCGACTCTCCCCACCCTAGAGCTAG 

TTAACCACCATATCTTCCTAGATCTCAGTTCAAGAGTCACTTCCATCCCC 

AATAAAAGCCCTTGAGTGCTGAGCACCTCTCCGTCATAGCATTTGTCCTA 

GGGGTTTTTG7ACATTTTCTTGTGTGAAACTTGGGTTGACATCTGTATTT 

CCGACTAGATTACAGTTTCCTCAAGGGTAGGGATGTCTTGCTTGCCATTT 

TCAGTTCCAGCATCTAGACAGTACCTCAAGCAAACAAGGCCGAGGGGGGT 

GCGGATCACGAGGTCAGGAGTTCGAGACCAGCCTGATGAACATGGTGAAA 

CCCCGTCTCTACTAAAAATATAAAAATTAGCCAGGCGTGGTGGCAGGTGC 

CTGTAATTCCAGCTACTCAGGAGTCTGAGGTAGGAGAATCGCTTGAACCC 

GGGAGGTGGAGGTTGCAGTGACCTGAGATCCACTGCACTCCAGCTTGGGT 

GACAGAGCAAGACTTCGTCTCaVAAAAAAAAAAAAAAAAAAGAAAGAGAAA 

AGAACATCAAATGAATGAATGAGTGAGATGAATGAGTTAGCAGTGTTGGA 

TTTAAGTGTCAGATTCTTCCCAGCTTGACTTTTTTCTTTGGCTTAGTGAT 

TTTGAGGTCNCAAGATTTATTTTCCTTTCACAAAGGTGATCACTACCATA 

AGATCTTCAGAAAAAGAATGTGGCAAGCCANGTCTCACTAATGCAAATCT 

CTATAACAACTGTATCAGTACT 
>Concia43 

GAGGTGTCATAAATATGGACCGATAGATGAATACAGGTAGGATGGGACAC 

AATCTAAGATCCCAGGGGGGGGAGACCACACGCTTGGTTAGGGAGACCCA 

AAGTGGACCGTGTGGCCAGAAGAGTCCCGCACTGCACTCTAGTGACAGTG 

CAGAAAGTCACTGTGGGAAATCTAGAAGTTTCTACAGGTTGCTATTTCAT 

CATAGCACTGTGCAGGCCAACCCTTCCTGCTCCACTGGCTGTTGGGAAAA 

GCTTTCTCTTTTCTTCCTAGCCAGGGAGCTCTCAAAGTGTTCCACTCTCT 

CACCTCCACCCAGGCGTCCAGGTGTGGAGGACACTTGCCGGCTGCTTGTC 

TGCTGACTCATCCCTTGGTTTCACTTGGAAAACCTACCACCAGCTGGCCT 

CTTTCCAAGCATCAGCCTCCTCATTTTCTTAATCCCTTAGGTGTGATCTC 

ACCTCCACACAGTAGATTGCCTCAAGGCCCAATTCCAATATGAATAAAAA 

TGATTATTTTGTCATCTTCCAATCTTCCTTTTAAAATATTATTTTATAAT 

TCCCTTTAGGAGGATCACCTAAGTGAAGACTATTTTTACCTAAGAAATGT 

TAAAATGTAAAGACATGGTTGTAATCTGGGGATTCCTGTTAAAATGGCTA 

GCAGACAGAAGTCAGACGACAGGCTAGAAATGTGTGAAGAGTGGTTGCCT 

TTGAAAGGCGGAGTTGGTAATGATTTTCTTCCATTTTTCCATGCTTTCCA 

ATTCTCTACAAAGGCCTTAATATTACTTCGATAACCAGGACCTCTGATAA 

CCTGCCCCCACCGAGTAAAGACTTAGCTGGGAAAGTCAGCTTCATGTGAG 

GTAAAAGGAACCAGGTAATACACAATTCCCACTGCCAACTGTCGGGTGTG 

CAGGCCTGAGCTTCCTGCATGTGGGAGGAAAGAGAAAGAAGAGAGAAACT 

CCAAGATCCAAGAGATCCAGCAAGAAGGCTGGAGTCTGAGGACGCAGAAA 

GCTGAATGGCACAGTTACCACTATTGTGCTGAGGTTCTGTGGCCTCTGGG 

TCTCTTGACAACTGGG CAAAGAC CCACAGAAAACTATCT CTAGAC CCTAC 
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CTGTGGGAGGGGAAAG1 _ JTTCAGATCAx CTACAGGACAGCCACCTGGk--' 
CTCAAATGGCTTACAGTTCCTTCATCCAGAGGGTCTTCATCTAGTACATA 
CCAGGTGCTAAGCCTGGGTGCTGGAGACATGACGGGGAACCCATTTACCA 
TGGCTTTGTTACTGTGAGATTCACATCTAGGGAAAGCCAGCAAAGGGGAG 
GGATCGAGGAGAGCTTGTTAGGCAGAGAAAATACCCAAGGGCAAGGGAGA 
AGCCAGCCTGTTCTGAGCACACACAGTGGTTCCATCTAACTGGGCCTCAG 
TG£CAGGTTGGACTGGAGATGGGGCTGAGGAGCTGTGACAGAGCATTCTG 
GACACAGATGTCACATAGTCCCTTGAGGTTAGGGTCCTTAGGCATGGCAG 
CAT7GCTTTGAGTTTTTCCTTTTGTAATGTTGCCATTCATGACAATGTGG 
AAGATGGGTCCTTGCAGAGAAGGGCAGGGCTGTGAGACCAGTTAGGAGAC 
TAAGATGTGAGCCAAGGAAAATGAGGAACACCTGAACACTGGGGCAGGTG 
CAGGGCCCAGAGAGAAGCAGATGGCTTCCTGAGGTTTTAAGTAGGTAGAA 
TCAAGGCAGCTGGTAAAGATCTTTTATTACATATAAACTGGAATAAGCCA 
TCTGCTCCAAGACAAAAGAGTAGGCGGAAAACAATACAAGACAGAAATGG 
AATTAGAACAAACCTGGGAGGAATGTGGAATTAGAGTAGAGAGTCCAACA 
CTGGCTGCAATCATAAAAATGTAAAACAAACAAAAATTTGCSAGGTGTGC 
TTACTTAGAAATAATTAGCTGTCATATTAAGTTCACTTGTGTTATGGCTT 
AAATGTGTCCCCCAAAATGTGATGTGTTGGAAACTTGATCCCCAATGCAA 
CAGAGTTGAGAGATGGGACCTTTAAAAGGTGATTAGGTCATAAGGGTTCT 
GCCCTCATAAATGAATTAATACTGTTATCATGAGAGTAGATTCCTGATAA 
.AAGGATGATCTCTGCCTCCTCCGCAGAGCCCTCTTGTGCATGCTTTCCTG 
CCTTTCCACCTTCTGCTATGGGATGACACAGCAAGAAGGCCCTCACCAGA 
TGCAGCTCCTTGATCTTGGACTTTCCAGCCTCCAGAACTGTAAGCCAAAC 
AAATTTCTGTTTATTATAAATTACCCAGTCTCAGGTATTCTGTTCTAGAA 
GCACAAAATGGACTAAGATCATTAGATTATCATTTTTTATCAGACTGTTG 
AAGTGAAAAATAAAAATCAAATAAAGAAATTAAGAGAGCTGCATGCAGCA 
GCTCATGCCTATAATCCCAGCACTTTGGGAGGCCAAGGCAGGTGGATTGC 
CTGAGCTCAGGAGTTTCAGACCAGCCTGGGCAACACGGTGAAACCCTGTT 
TCTACTAAAATACAAAAAACTAGGCCGGGCGCGGTGGCTCACGTCTGTAA 
TCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCATGAGGTCAGGAGATC 
GAGACCATCCTGGCTAACAAGGTGAAACCCCGTCTCTACTAAAAATACAA 
AAAAAATTAGCCGGGCGCGGTGGCGGGCGCCTGTAGTCCCAGCTACTCGG 
GAGGCTGAGGCAGGAGAATGGCGTGAAACCCGGGAAGCGGAGCTTGCAGT 
GAGCCGAGATTGCGCCACTGCAGTCCGCAGTCCCGCCTGGGCGACAGAGC 
GAGACTCCGTCTCAAAAAAAAAAAAAAAAACTAGCCAGGCATGGTGGTGT 
GTGCCTATAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATTGCTTGA 
ACCCAGGAGGTGGAGGTTGCAGTGAGCTGAGATCATACCACTGCACTCCA 
ATCCAGCCTGGGTGACAAAGCAAGACTACATTTCAAAAAAAAAAAGAAAG 
AAAAAGAAAAAAAGAAAAGAAAAAGAAATTAAGAGAAGGGCAGGTATTAA 
CCCCAAATATCCCACCATAGGGACACATTAAAGTTTGCTTGGCCACTCCC 
CTAGCATAATATATGGAATGTCTTCAAGGACCCTCTGTTGTAAATACAAG 
GCCCTGCTGGACTTAATACAACCTGCAGGCTTTGAGATCCCTACTCTGTT 
GCCATCTCTCATAGGATTTGCAGACCAAATCCAAATACTTAAAATAGCAA 
CACTCACAAACATGCAAATCAGAGCAGAAAAGAAACTTCTAAAAGGCCCT 
GAAACTACACTTTATGAGAGAAGACAATAGGGACCTGAGGGTGGTAGAAT 
TTTCTCTCTATGCATCTATGTTTCCAGGGCfCACTTTCTCAATAAACTCT 
TAAATTGCTTTTAAAGTAAGGGAACAAGCAAACATTACATTTAAGAGAAA 
TCAATTTCATAAAGAAGGGGGGATGTCCAGGGTACTTTGCTTCCATGTTT 
TGCTTC CATGAATTTGTGTTTAACAGAAGATGCAGAAAAACACACAATTA 
TTGCAAAATCAAGGAAATCCACTCTAAACATCCCTTGGTTTCCCAGGCCA 
GTGTCACAACTGAAAACACATATTGTGGCTAATTATGTGTCACAAATTAG 
AATGACAAGGCAAGAAAAAAAAAACTCTCTGATTAACTAATAGCAGCCAA 
CACAGACAGCCTGTGTAGCTCGACTCTGCTGGTTTATAAAAGGCAGAAGA 
AGCAAACGGCTTCTGTGACCGCAACAGGAAGGGCCTCTGCTCTTAATAAA 
TAAATAACATTTAAATTATTCTCCCCCATTTGCAAAGCATTTTCCAACTC 
ATTATCTCATCTGACCAGGTATTATTGTATCTGACCAAGAACTTGTATAC 
NAAATAAAGAATAAAAAATAAATATGGGCCANGCACAGTGGCTCATGCTT 
GTAATCCCANCACTTTGGGAGGCCCAGGCGGGTGGATCACTTGAGGTCAG 
TAGTTTGAGACCAGTCTGGCCGACATGGCGAAACCCCGTCTCTACTAAAA 
ATACAAAAATTAGCCCGGCATGGTGGCACATGCCTGTAATCCCAACTACT 
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TGGGAGGCTGAGGCACGHGAATTGCTTGAACTCGAGAGGCC^GGTtGCA 1 

GTGAGCCGAGACTGCGGCCATTGCCCTCCAGCCTGGGCGATGAGAGCGAA 

ACTTCATCGAAAAAACAAAAACAAAACAAAACAAAAAACACCTTAGAAGA 

AGCGTTCCTCCTCTTGCTTTCTGAAGACACTCTACGCTGAAACAGTAACT 

TTCAATAAACCATCTCTTCTCACCGCACTCTGCGACTTGCCTTGAATTCC 

TTTGTGTGCAAGATCCAATAAGCCTCTCTTGCGGTCTGGATGAGAACCCT 

TT^TTTGGAATACTCTGACACAACAAATTGCAGAAAGAAAGTCTCACATG 

TATAAAATAAGCAAAAAGATTCTCTGGCATCTGAAGAAACAATTTCCTTG 

TCAATATTAGTATCACTATAAGTGTAGAACAACCTGTTGTATGATGCTAC 

ATAAAGTATATGAATCTGAATACTGTTGGATACAAAGGGAGACTATNNAA 

TGTAATACGTCGCCCGAAATGACTACACTGTTGGTGATCTTTCTTTCAAG 

AAGCANAATATTGCCTCNAACATCCTGTACATGGTATAAAATTTTA 

>Contig44 

CCCAGCAAGAACACCAATACAACGGGGGGGGCGTTCTTTGTGAGGGGTGG 
GGAGGTCAATTTTTTGGAACCTGCAGCAGGTAACACACAAAACTTCCACA 
GCTGCTACCAGCTTTCCAGGAGAGCCTGTGTACCTGGAGAGGAGAAGGCA 
AGTGCTTCCGAACTTGACTTGATGTCTTAGATTCTGCAATGCGTAGTCTG 
TAGGGACAGGCTGTAGCTTATCCTATAGGCTTGGGCTGGAGTCAGCAAGC 
ATCTGGGCTGGCAGAAGATAAAAGATGCAAAGGTGGAGGAAAGCATACGT 
GGTCTGGAAGACAGACTTGGTGGGTGGGTGGCTGCTACAACACCCTAGTT 
■' AGAGGTAGAGGGGTAAGTCAGTGTGTCTTCTGCACAGGCCTCTTCCCCAC 
CTCATTCTTCATTTCCCATACAGCCTTGCTGAGTTATTCACAAACATCTG 
ATTCAACTGGAAGCTGGGTTGAGGATGACCTAAAGGACTAGTGTGATGCC 
TGCCCAGGGG7GTGGGCCCATAGTCAGAGTCCAGAGCCTCCTCTCAGCTT 
TTAGCACATCTCACCCACATCCTGGGTCCTTAATTAGCAATATGAAAGCA 
AGCCAAGTGACAAGATTTTGTCCCTGGGAAGTCCAGAAGCACTCCTTTTC 
TCATTTGTATAAGCATAATGATTTGCTTACATAAATAATCATGAAAATTC 
AAATCCCTCTCAGAAATCAGGTCATAAAACCATGAAATGCAGCATGTGGG 
CAAGAATCACAGGGAAAGGTAGGTCTTGGAAAAGAAAGGATGGCAGGGAG 
GAAGAAAGCAGGGTGCCAGGGGCCCTGGGCTGCTGTCGAAGTCAGGTGGC 
TCACCGTCTCTGAGAACATTTCACTTTCTGGTAAATGGGGCAGTTGGAGA 
TAGAAGGGTTGGGTGAATGCCAAGAGTGAGCACAGCTGAGGTCAGTGCTG 
TGCCTGCAGTCCAGGCGGGAGTAGAAATCCTGGGCCCATCTTACCTCCGA 
CCTCATTTCCTCCTCTGTAATAATGTGGGGGTGGGGGAAAGTTCTGGTCA 
TCAGCCCTAGCATTCCATGGTTCATTTCCTCATCAGTGATGGAAAATCAC 
CAAGCAAGAGAACAGGATGGAGAATAACCGGATGGGTGCAATCGGAGGTG 
CTATTTCAGGTGAGGTGGCCAGGGAAGGCCCTCTGAAAGGGTGGCTTGAG 
CAGGTGGCTGAATGTACAGAAGCTGCCAATCATGAAAGATCTGGGGTACA 
GCATGCCAAGCAGAGGAAATGCGAGTGCAAAGGCCCCGAGATTGGATGTG 
GGCTTAGCACAAATGTGGCATGGCAAGAAGGCCAGTGTGGCTGAAGCAGC 
ATGAACAATGGGTGGAGGGGCTGAGAGGACAGAGGAGCAGGAAAGAGCCA 
GGCTTGGGTAGGAGAGGTGTCAACTTGATATATGATGCAAAGCCCTTGGA 
GGTTCCCAAACACAAAAGCAATGATCTAATATATGGTTTTAAAAATGCCA 
CTCTTGGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAG 
GCCGAGGCGGGTGGATCATGAGGTCAGGAGATCGAGACCATCCTGGCTAA 
CAAGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCGGGCGCG 
GTGGCGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAAT 
GGCGTGAACCCGGGAGGCGGAGCTTGCAGTGAGCCGAGATTGCGCCACTG 
CAGTCCGCAGTCCGGCCTGGGCGACAGAGCGAGACTCCGTCTCAAAAAAA 
AAAAAAAAAAAAAAATGCCACTCTTGCTGTGAAAAATTGACCCTGGGGGA 
AGGAGGAGTAGAAATGTCAAAAGTGGAAGCAGACCACTCAGGAGGTCAGG 
GCAATGGACTGTGCAGGAGAGACTGACATCTTAGACTCGGGCAATAGGAG 
AGAAGGTGGTGAGGATTATATTCTGGGCATAAAGGCAACAGAACTAGCTG 
ATGGCGTCAACGTAGGAGATGAGGGAAAGAAAGAAATCAAAGGGCATTCA 
TAGGTTTGAGGGTTGAGTAACTGGGGATATTTAACAGAAATGGAGAAGTC 
TGGGGAAGGGGCAAGTATTGTGGGGGCAGGGGTCAAAAGTTCTGTATTTT 
GGCCAAGTTAATTAATATTTGAGATACCTCTTAGGTGTCCAAGTGAAGAT 
GTCAAACAGTCAATTGAATACAAAATCTGAATCTTAGCC CAGGATGGTCT 
CACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGAGAGGATCACTTG 
AGGCCAGGAGTTTGTGATCAGCCTGGGCAATAGAGCAAGACCCTGTCTCC 
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ACACACACACACACACA. AAAAAGTCA'i^CAGGCATGGTGGC^GATGC^. , 
GTAGTCCCAGCTACTCAGGAAGCTGAGGCAGGAGGATCACTTGAGCCCAT 
GGTTCAAGGCTGCAGTGAGCTATAATCACATCACTCAATACTACACTCCA 
GCCTGGATGACAGAGAGAGACCTCATTTATTAAAATAAAATTTAAAAAAA 
TTAATTAAAAATAAATCCAAATCTTTCCTGAGATTCATATTCAGGAGTAA 
CTGTCATGTAGAAGGCATATAATGCCATGGGTCACATGATACCATCTAAT 
• GAATGCCACTGGAAAAGAGAGAATAGCTAAAAACTGAGCACTGGGCACAC 
CAGCACAGTGAGGTTGGAAGGAAGAAATGGAGCTAACAAAGGAGACAAAA 
GAGG AGTAGC C AGTG AGAAGAGAGAAACAT C TGGAGAGAAGAGAGAGCAG 
CAAAAGGTGGGTGAAGGAGAATGTGGTCCACCAGGCCCAACAATGCTGAG 
CAGTTGAGTAAGTGAGGACCTGGCCACTGAATTTGGCAAGAAAGAGGATG 
TCAGCGGCCCTAGAACAAAAGTGAAGAAGAGCTTGAGGACGGAAGCCTGA 
CAGGAGTGAACTGAGGAGAGAATGAAAGGTGGAGACATGGAGCCAAGGAG 
CACTGAGACTCCCTTGAGTAGTTTTGCTGTAAAATAAAAGTGAGTGCAGA 
GACGGGGCAGGGGGACAGAGAAATGCAGGGGTAGCTGGAGGGAGCCACAG 
AATCAAAAGAGGGTTTTTGTGTTTAAGATGGTAGTTGTCACATAGCACAT 
TAGTAAGTTCATGTGAATCACAACGTAGGTGAGACAGATCACTAATGCAG 
GAGTCAAATCCTTGCAGAGCCCCCAGAGGAGGTGATGAAGGGAAGTGATG 
GACATCATTCAGATGCAAGTAGGTTAGCAATTCCTGGGGTACAAATAGGA 
GGTGACTCCTTTCTGATTGCTCCTGTTTTCTGAATGAGATAGCACATAAA 
• GTCCACTCAGCCATGTTAGCTGTTGAAGTCCTTGTGGCTGTCATGCCTGT 
ACAGACTGGGCTCTCCTCTCCAGCATTTCCTCTCAGACTAAGCTGAGCTG 
CACTAGCCGCTGCCACATCCTCTTGGGGCCATCCTCTGCCACACTCCACA 
TATTGCTGTGGTTTGCTTGCAACCCCTGGAAGGTCCTACTGGCTGCTCCT 
AGAAGAGTCTGGGCGGCATCTCTCCCTTACTCGTTATCACATGGTGCTGT 
AAGCAGTGGCCACACACTTTAGCTGGTGGGATGGGCCATCACAGGCAGTA 
AATGCGAAAGACTGCTCAGATTTTAAAGCACCCATGAATCAGTAGAATGA 
GTTTAGAATTGTAGTCATCAACACACA7TAAAAAAAAAAAAAACAGGCAC 
TAAAAAAATTAGTTGAGTAGGATAAAGCCATAAAAGATATTAACTACAAC 
CCAGATAGGAGGTGCAAAATTGTCCTTACATAAATCAGATGGAAAAAGTT 
GAAAGCAGATAAGATAAAATAGGTAAGCATGACATTTAAAAGGTATTCAT 
GGGACGTGGTTACAAAACCAACTCACAACTAAAAAGTCTTAGGACCTCTC 
GCTGACTTAGGAGCCTGATCCCAACTTTGAGAATGACTCAGTGTGTTACC 
CTGTGGCTAGTGTAGACCAATGATCCTGTCTCAGAGTCACTAGCCAACAG 
CCCATATCAAGTAATTGAAACTTTGACTCAGAAACCTCAGTGTCAGAACC 
TTTGACTTAGGAACCACCTGTAGTGGTTAACTGCAATTTGCACCCCTTAG 
TTCAGGGCTTTACAACACCGGGGGCGGGGAGGGGGAAGGCATAGAGCTGA 
TGACCTAAAGGAAACCCATTGCAGCAACGCTTTTGTGTTAAGTTTACAAA 
TAAGTGTTGTTTTAGAATCCTCCAGGTAATGCCTTTGTTATTTAATGTGT 
CTGAGACAATTCTGCACATTAAAGAATATAAAATATTACCTTGTAATTCC 
AATTTGAAATGTGTAATTGACATTAGACTTCTATTTTAATTTGAAATGTC 
TAAAACAATGTGGTTAAGTTTGTAAAAGGTGTGTGAATTTTGAGTCTGAT 
TTACTACATTTTTTTTTAATTTTCTTTTTTTTTGGAGTTTTAGGGATTGC 
TTAGATGGCTAGAAAGATCGCTAGGCACATGTCC 
>Contig45 

GATGTGTGTACGTGTGTGCAAATACCGTGCCTTTTTTGTTTTCTTTTGTT 
GAAACAGAGTCTCACTCTGTCGCCCAGGCTAGAATGTAGTGGCGTGATGT 
CAGCTCACTGCAACCTCCGCCTCCCAGGTTCCAGTGATTCTCCCGCCTCA 
GCCTCCCAAGTAACTGGGATTACAGGCGCCCACCAACACGCCCAGCTAAT 
TTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGTC 
TCTAACTCCTGACCTCGAGATCCACCCACCTCGACCTCCCAAAGTGCTGG 
GATTACAGGCATGAGCCACCATGCCTGGCCAATACTGTGCCATTTTATTA 
TCAGGGACTTGAGCATCCATGGATTTTGGCATCCATAGGGGTCCTGTAAC 
CAATACTGCACAAATACCAAGGGACAACTGTATTCTAAAAAGACCAAAAA 
TTAATAAGCAGGACGCTGAAGGTAATTGCCCCAATAAAGTCATGATCCCT 
TGCCCAGTGTCTGAACCTCAGCCAGTTTTCATACTCAGGACCTATTGGCT 
GCAGAGGTGGTAGGAACCATATGAGAATCCTGCAATATCATGGCAAGTAT 
GCACTTTAATGATATCTGCAGTCCTTCCCCAAAAGGACCTTACATTTACC 
ATACTGCTATGTCCTGCGTGAGAGGGTAATACTCAGATTTTTTTTTTTTT 
T77TTTTACACAACGTCTTACTGTGTTGCCCACACTGGAGTGCACTGGCT 
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CGATCTTAGCTCACTGC .CTTCTGTT1 - JTGGGCTCAAGTGATT§flf' 

GCC7CAG777CC7GAG7AGC7GGGA77ACAGGCGCCCGCCACCATGCC7G 

GCTAATTTTTG7ATTTTTAGTAGAGACGGAGT77TGCCATGTTGGCCAGG 

CTGGTCTTGAACTCCTGACCTCATGTGATCCGC7GGCCTCCCAAAGTGCT 

GAGATTCCAGCGTGCGCGGCCATACCCGGCCGGGAATTCTTTATATATTC 

TGAAAACTAATCCTTTGTGAGACATAAGTGTTGTAAATATTGTATCCCAG 

TTTGTGGCATGTATTTTTAATTTTTAATGGTGTCTCTCAATGAAAAAAGC 

TTAACACTTAAATGAGGTCAAATTGATCACCTTTTTATTTATGGTTGATT 

CCTTTGGTGTCATGTGTAAGGAATGTTGTTCCTTCCTGTCCCAAAGTTGC 

AAAGATTTCTTGTGTATTTTGTCCTAAAAGTTTTAAAGTTTTGCTTTTCC 

CATCTGTGCACATTTCACATTTGCTACATCTCACTGACTGCTTCCTCTGC 

TGCAGAGCAAGCTCCATGAGAGCAGGAGGCATGGGTCCTGCTTCTTGTTG 

GTCCCCAGAGCCCTATGTCATGACTAGGACCTGGCAGGGGACTAGTGAGT 

AGCTCCTGACTAACTGACTCAATGAATGAATGATTGGATGATTGAACAAA 

GTGGTATGGGAGTTCACAGCGAGTAAGAGATGCCTTAGAAGAGATGAAGA 

AGGAGATGGTATAGGGTAGTGGTTCTCAATTCTGGGTCCATGGTGGACTC 

ACCTGGGGACCCTTAAAATGTACCGTGGAGGATCCCAGCCCAAGAGATTC 

TGTATGACTGGTCTAAGATGTGGTCTGGGCACCAGGTGATCCCAGTGTGC 

AGCCAGGCCTGAGGCCACTGGATTTGGTGGTAAATGAGGTAACTATCAAG 

GGTACAGACGTTGGTTGCCAACAGGCTTGGGCTTGAATTTAAGCTTTGTC 

ACTGACTTGCTGTGTGCTCCTGCACTCGTTGASCCTGTTTTCTCAGCTGA- 

GAGATGGGTGTGATAACACCTACCTGCTGTAGTTGTTGTGAGAGTTAGAG 

GAGATAAGCATGTTCCTGGAATGAAGTGTGTTC7TAATCCATCATAGGTT 

TTTTGCTTGTTTGTTTGTTTGTTTGTTTGTTTTTTCCTTTTCAAGAATGA 

GGTTGAGCCAGACTTTGACAGCTGGGTGGGAAGTGAACATGTGGTGATTG 

GGAGAGAAGGGCAGTTTATGTGAAGGGAATGTAATAATTAGAGAGTGGGC 

GTGGGAAGACATGCTGGGGAGAGTGAGCAGGCCGGTTAGCCCTGGTAGAG 

GGTGCAAGAGAGCAGTGCGGAATCTGCCAGGGAGACAGGTGGGTGACCAG 

GGTGCCAAGGGTGTGGCTTTTCCCAGGTTCCCATGGACACAGCCATCCTC 

CCAGATGCCCAGCCTAGCTGTGAGTGAGCAAGAGTTCTGGATTGTCTCTC 

TCACTCTGTCTTTTTCTCTCATTCCAGAAACAAAGCAGTGACTGGTACTT 

AGGAGGAGAATCAGGTCAAGTTGGGAGAAACTTGCTTCTGCTCAGGGGAG 

CAGAAGCAAGAATGGAGGCCCCACCCATGCTGGAAGATGATGAGGGTTTT 

GGTTCAGGGAGGAGGAATATTGGGGATCTAAAGGGGCCTGGGAGTGGGGC 

AGGACCCTGCCTTAGGACAGGTAGAAACATTTTCTATAAAAAATGGGGTG 

GAGGTTGATGGTAGGACCAGGCATCTTTAGTTGGCTCCCTGGAGTGTCAA 

GCCCTTGAGATGGTCTTTAAAAGCCATGCAGTGGGG7TTGAATCTGGTGT 

7CAAGC7CA7AGG77A77AACA7AA7GACAC77GGAAAC7A777GGGAGA 

GC7CAAG7GAG7GGCC7GGAAG77C7G7G77GG7GCAGGAGG7GAC77AG 

GA7G7GC7GC7CCAGAC7CATA7C777GAC7GCACACC7GA7GC7TCA7C 

7GGC7A7CC7G7AAGCACC7TCAAC77AACA7G7CC7ACACAGAAC7CT7 

GA7A77CC7G77CC7CCCCCAGT7CC7CAG77C77ACCAAA7G77C77CC 

AG77ACCCAA77GC7CAAG7AAAAAA7C7AAG7CC77C7C77GGAT77C7 

GCC7G77CCC7CAACA7CCCACC7A7CCA7GAG7G77C7G7GGGCCCTGC 

C7C7GAAA7AAA7CC7GCC7T7G7C7CCCAG77CAC7CCAGCCACCCA7C 

C7GGGGC7GCACCC7CCTCCT7CCAAGCCCTC7CCC777CC77CCTGG7G 

C7GCC7G7CA7G7CAAGCA7A7GCA7CAG7GCGACCAGGACA777GAAA7 

GCAACCAG7ACAA77GGGCGCGG77A7GCC7ACCAG77777C77CC77AA 

ACAT777A7AT77A7GTT7GAAAGCA7GCCACCTT7C77CAC77GCCAAC 

77GACAGA777A77AGTTGACAACA7CCGC7GA7AGCA7CAG7AA7AAG7 

7AA77G77777GCACA7G7AGC777AA77A77C7CA77A7CA777A7AGG 

AG77A77C777G7AAAGGG7AAC7GAG7777CCAAAACAAACAGAAA777 

GGGG7GGGCCCA7GGAGCG7GAC7CA7GAAA7CAGA77C77AGAAGGACC 

7CGGCAAG7C7C7GGG77GCTGT7AA7GAGCC7GGC7GGC7GCCAGGGG7 

G7G7C7GCCC777A7GAGGCCACCAC7G77CAAA7GC77GCC7GCAGCA7 

7AC77GCC7AGG7AG7GC77G777C7AC7GAAC7G7CAGGGA7CCAA77C 

T77G7GG7C7AAG7AACAA7AC7CAGA77CACAAGGAA77GA77AA7AAG 

CCAGAA7GCCAA7G7A77ACA77777GA7GAAGACCA7A777ACAG7GA7 

7G7A7C7GC7CAAGC7CAAAT7AGGA77AGAG77C7GACAAA7ACA7A7G 

TGAGAAG7A7GAGG77AAA7AC77GAAA777GGAC7777C7AGAAAA7C7 



FIG. A (25 of 61) 



7^ 1 7/ r • 





WO 99/06426 



PCT/US98/16102 



GAATGTGATTGCCATTCACATACCTTTCTGGGGATGATGATTCTTGTACT 
TTTATTTTAAAAGACATAGAAAACTAACTTAAGAATCAGATTGCTTGGCT 
GGGCACAGTGGCTCATGCCTGTAATGCCAGCACTTTGGGAGGCCAAGGTG 
AGTGGATTGCTTGAGCTCAGGAGTTTGAGATCAGCCTGGGCAACATGGTG 
AAATCCCATCTCTACCAAAAATACAAAAAAAAAAAAAAAAACAACCAAAA 
AGAATAAATTAGCTAGGTGTGATGGTGCGTGCTTGTAGTTCCAGCTACTT 
GGGAGGATGAGGTGGAAGAATTGCTTGAGCCCAGGAGGTGGAGGTTTCAG 
TGAGC7GGGGTTGCAACAGTGTACTCCAGCCTGGGCGATAGAGTGAGACT 
CCGTCTCAAAAAAAAAAAAATCAGATTGCTTTATTGCTGGTTTTCTTTCT 
AAAACTGAGATTGGGTCCCATCATCCCCTGGCCCCCATTGGTTAATGGTT 
CCTCCTTTGTCTATTGAATAAAATACAGATGTCTGCTTTTGGCAACATGG 
TTGAATGTAGACACTGCAGGGTCTTCCTGACTCAAAATGATTTAGGCTTA 
GATAAAACACATTTGGAAATGCATTTCTGGATTAACACCAAGGAAAGGAG 
ATCTCTTTAAATCCCCTTTCTGTTCCCCCCTCCCTACCCCCTCCAATTGG 
GCTTAAGTAAGAAGGGTGGTTACCCGCTAGTAAACCCCCTTCGAAGGGGG 
TCTTCTCCTCTAAGGGAAAACCCTTGTTTTGACATTTGCTTCSATGGGCC 
CTTGTATTTTGTTCCTTGCTAAACGGGTGCTAAACCAGGGGCCTCCTCTT 

>Concig46 

AAGGCTTTTAGAATATTTGCACACTTTAGAAATGGAAATGTTTTTGGGGG 

GCGAGTTGTCTTAATATTTCATTTTTCTAGCTTGTGTGACATCCTTTTGA 

AAGCAGCAATTCTGGCCTTTGTGAGAGATGGTGAATGCCTGCAGGTGTGT 

GGACCAGTGCGTCCCTTCCTTCCTACATGCACGGCCCCCAGCTGGGCCCA 

GCAGAGTGCTGTTACAGAATAATTTCCAAGGGCTGTGTCTCTAACCTTTG 

GTCTTGTCCCCCATTGCTGTAGATTTGGCCAATTGACTTCATAAGTGCCT 

CTTATGAACATAGATGTTGGCAATGGAAGTTGAGGACCAGTCAGTGGTTG 

TTTTATTGAACACACAGCGTAAATCCCAACACAATGCTGACCTAAGAGAA 

TTCCAGCCACTCTGATTCTCAGTCTCTTTATATCTGAAAGGGTTCTGTTC 

CACTTTTTCCCAGATCAAAATGTCCCTGCAGCTACTCAGCAGAGCTGTCG 

CAACTTATACGTAGAAGAGGTAACAG7CCACAAACAGAAAGGCACAGGAC 

GAGAGTGGTCTGGGTGATGCTTCCTGTGGGGGAAAAGGTGATGAGGGTGC 

ATCTGCACACCTATGTTCATAGGTAAGTCTGGGAGGAGGTGACCTCCCCT 

TTGGTTGAGGTGCTGAGGCGTCTTGTTAGAATGGCACTATTCCATTTATC 

TGATGCAGTCTGTGGGAATTTTGTGGTATGGCCACCACAGGTACCATGCT 

GGGAACAATGCCAGATACTGCCTGCTAAGCCACAGCATGAGTCACATGAG 

CATTTGTGGGCTTTGGGAACTAAAGTTATTGAACGATAGTTATCTGAAAA 

3GAATTTAGGGAAAGGGGACTTTAGTCCAGCGAACAGTTTGCAAACCAGG 

GGGAAGGCAGCCTTCAGCGTAAAATGAAGACGTGTGTGCCCCAAATAACA 

AAGGGAGAGTTTGTCTTTTAGAGAGTAAATGTCCACGCAAGGTTCCACTT 

AGGCAAATGAAAGATGCAAACTTGCTTAGTTCTGATTTGTTTACATTTGC 

TGAATTCGGATTGGTCCGTGCAGGCTTTTCTGGGAACTCCAAATACATGT 

ATGACCTCTAGTCATACATGGCAAATGGCCGCTTGGCTCTAATTTGAATT 

TAGGCCCAGTTAGTCACTCAGGATTAACCTTTTTCAGGGTTCACAGCTCT 

GAACAATGGACTTAGACCTGCAGGACATAATCTGTTCCTAACTCTGGGAC 

TACCTGTGCCTTTTGACTGTGCCCAGTGAGCAGCTGTGGCTCTGGGCCCA 

GACCCACAGGGCGATAAGGCACAGAGGTACGCATGGAGCAGGCTGTCCTT 

GCTGAGTGATCATGAAGATACACTTACATAGAGCAGCACTTTTCCTTCCA 

GTCTTTGTGATTTAACTCATTAGATCCTTATAACAAGAGTCAGTCCTCTA 

TTTAACCCATGAAGCACAGGTGGAGTCCAAGCTTAGTTTGTGAAGGATGA 

GCCAAAAGGATTCTTCTCTTGTAGACCTCAAGCTCAGCTCTCTCCATGGG 

CCCTGGAGTAGGTGAGAAGGCCTCTGTCTTCCAGAGCCCACTGCCAATCA 

TCTACATTTTCTGTTAGCCCAATTCTAGGACATTGCTTTACCAACTGAAG 

GGTGAGAACTATCATAAGTTATAAAAATCAATTGAAAAACAAAAAGGTAC 

AGAACAGAAAATAAAAGATGAGAATCTATTAAACATAGTGATGTTACTGG 

AAAAGGGGGTCTCAAACCAGACCCCAAGAGAGAGTCCTTGGATTTCACAC 

AGGAAAGAACTCAAGGTGAGTTGCAGGGTGCGGTGAATTGAGAGAGTTTA 

TTGAAAGCTATTCCATTACAAAGTAGAGCATCCTCAGACAGCAAGTGGAG 

GAACATGCCATCATTAAATTTTTCTTATATAGGAATCTTGTCTATATAAA 

GACTAAACTAAGCTGTGGCTATGTGTGGGTGGGCCGACAGCATGAAAACA 

TTTATTCTCCTATTGATTTAAAGAGAACTATCCTTGACATTTTAGTGTGT 
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77AAG7ACA7CAAAGCAl^C7A7AA77A"fC77GAAAGCA7A7A7777Ti* 

TAGGGATTGGGACATCTGGGCTTTCTGTTGTTGTAGAAGTTTGTCCTTGC 

AGGGATTACCAAGCCACTTCCTTAGCTGTAAACATCTTAGGGCCATGGGT 

CCTGACTGGCAAGGAATGTGTCTTGCTAGTTTTAAGATGGGCTTGATTTG 

AAAATGGTGTCCATCTGGCTCTCCTAGGCTCCTGCTTTCCTAACAGTAAG 

GGTAAATGC7ATGTTATGAAATGTCATTTCTGCCTTTAGCTTGCAAACTC 

TTGATGGTGAAATTCTCCTGTCCGTTTTCAGTGGGGTATTTATTCTGCAT 

CCACGTCTTCACAAGGAGCTGAAAACAAATTGGATGGAAGCAACTGGGTT 

TTATGGGACACGTTAATGTTTTAATGTCATTTGGTGTGGAATTCAGATGT 

CCAAGCAACATTTTACACTACAAATCTGCAACT7TAATAATCACTCAAAG 

TACCTGAACCTCAATGCTTTCAGACAGACTTGGTATAAAGCCACCACCTC 

TTTCTATTATGGCAGCCCTATCCTGAGGACACAAATTTCTGCAGGGCTTC 

TGGCATATCTCTGATTAAACAAATGTCAACAAGGTTAAAACAAATGTCAT 

CTCTGATTTGTTTGTTTTAAAGCCTGGATTTACTCATTGAATATTTCACT 

CCTACTAGCATGTCTTGTAGTAGTTTTCTTCAGGGACCCTAATTATTGCT 

ATTAAAAATATGTGTGCAGCTACATGTTTTTTTTTTATCAATTTGCAATG 

AAAACTTTAATTGAATAATCTATTAGTGTTATTATTTGAAAGTGAAATCT 

TTTCCTTTTGCTTTCTTGTTCTCACACATAGTGCAGACAGTTTCCACACG 

GGCTCATAAAAGGAATGATTCTGCCTTGTGTGAACTTTTTGCCTTTATTG 

TTAATTGCACCATTTTGTGACTGGCTTCTTGACCCTGTTGTAACCAAGCT 

CATAATGTACATTATTTCTTATTTTGCAGTTGTAGACZACTTGAGdXAGTT 

CCCATTC7TTGTTTCTTCTTGCTTTTGTTCCCTGTGATAACTTTTTCATG 

CAGACATTTTTTTTTTTTTTTTTTTTGAGACCGAGTCTTGCTCTGTCATC 

CAGGCTGGAGTGCAGTGGCATGATCTTGGCTCACTGCAACCTCTGCCTCC 

CAGGTTCAAGAGATTCTCCTGCTTCAGCCTTTCTAGTAGCTAGGATTGCA 

GGCGTGCACTACCACACCCAGCTAAATTTTTCAAATTAGCCACCCCACCT 

GGCTAATTTTTGTATTTTTAGTAGAGACAGGGTTTCAACCATGTTGGCCA 

GGCTGGTCTCGACCAGGTGATCCACCCGCCTTAGCCTCGCATAGTTGCAG 

GTGCTATTCTGAGCTCAGGGCTCTGGCAGCTACAAGCCCAAGATGCGGTC 

TCCAACATGTGGCCATTCAATGTCATGGCGCCCTCTACTGGTCCTGGGAA 

GCGCAGCTCTGCCAGTAGCTCCAGCAGGGCACAGCTGTTAAGTCGTGATG 

TTCTACAGGTGACCAAAGGGCAATCTCTGGACTCCTTAGCCGCTAGGTCC 

TCTCTGTAGCAGGACCCAGGAGAAGGCAGGGGCTGAGGATGGCTCTCTTA 

GACATTTGTGATGAACCAAACGTGTGCATTCATGAAACTTCTGTGAGCAA 

GCAGGTGAGTAGAGTTGGGTTATAAAAAGTCTTAGGGTCTCACTACAGAG 

ATGGACTTGCTGTGTAGATGGTGCAGAGCCGCTGAAGAGTTCTACTTGGG 

G7AA7GG7G7GA77GGG777GCG7777AGGAAGA777C77GGCCAGAA7G 

AGGCGGGCAACCCAGAGCAGGGAG7GGCCACA7G7GGG7G7GCAG77A7G 

GGCCAC7AA7CCAGG7GA7AAA7GG7G7C7C7GAAC77CAGG7GGGGG7G 

CCACA7G7C7CCATC7GC7C7G7ACCC77GAGAC7GGCC77A7GGGC7GC 

C77AG7GG7C7G77GTCC7C7A7C7CC7GG77GGGC7CAGGCAA7GGGAG 

A7CAGAGGGAGGAAAGAGAGCT7GG77AGAG7GCACCCGCGCCCC77CAG 

G77GGCAG7GGCCACA77CCCC7A7ACAGAAGGCCACAG777C7G7CAG7 

GGCCC7CCCACAGCCCCAGC777C7CAG7GGGCCAGCCACC7CCCCA7CC 

C77GC7CC7CC7CC7CCAGAGAGGG77G7GGA777CCAC7G7CAGCAG7G 

CC7GGAGC7CCACCA7C7CC7GC7GC77CC7C7GGACC7GCC7GCAG777 

7A7AAA7AACC777CC77ACA77ACC7C7AGCA7GCACC7777G7G7G7A 

7AC7C7GCCCCC7G7CAGCACA7GAC7CA7GCCAAAGAG777GAA77777 

77C7CCAGGCAACGGGAGG7CA77GGAGGA7777AGACA77GAGAACAGA 

7G7G7A77G7GGAAA7A7C7G7C7GAC7GAAG7GACCAGGA7GG7CCAAA 

AGAGCGAGAA777GAGGCAAGCAAACCA7CAGCAGGCCAGCAGCAGAAA7 

CCAGG7CA7AAACAGGGAAGC7GAGGC7CACAGGG77GGA7CAGGGAA7G 

GGAGAGGGAAGCCAAACAA77CCA7GAGCA7G7CAG77GCACA7A7GAC7 

7GG7AAC7A77777A77777A77777A7G7777GAGACAGAG7C7CGC7C 

7G7CACACAGGCCAGAG7G7AG7GGCA7GA7CACAGC7C7C7GCAACC7C 

7GCC7CC7AGG77CAAACAA77C7CC7GCC7CAACC77CCAGG7AGC7GG 

GAC7ACAGG7GCGCACCAC7ACACCCAAC7AAG77G7G7A77777AG7AG 

AGA7GAGCA77CACGC7G77GCC77AGACACGG 

>Concia47 

AA7A77GA77A777GACCAGAAA77CA7GCAGC7AACCG7GACCCC7GGC 
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J^AAATAAAATAGTGTAT^.^GTACGTGCATATACATGCAAAGAAATGAGx . 

GAAACTAGAAGGATGTCAATCAAATGATAACATGGTCATCTTGGGGTCGG 

AGTACATTTGGGGATGAGGGGAGCTGTAAAAGCAGACTTGGACCTTTTCT 

TCTACCAGTACCGTGTCATTTGAATTTTGGAAAGAAAAAAAAAAACTCAG 

AAGGAGGAGAAGGAGCAGGAGGAGAAGAAGATGGATCTTAAGTGATTTGC 

CCGGGAGCACCTTGAGAAGGTGAGATTCAAGTCTAGGTCTAAGCTTTCTA 

ATTCCATGAGTGGGAGTGACCCACGTCCAAGAGGAAGCTCAAAAGGAAGA 

TGTTCTCCATCATCTCTTGCTCATCCTAACAGCATGCAAAACCACATCCA 

ATGCAGCTCAGAAAACTCCCAAATTGCCAAATTTCATTGGAAACACTTAA 

TGCTGTGGTTTCCAATTTCAACTGTAAAGTAGGTATGTATGCCATTGTTA 

CCATTAACTTCTCAGAAATGGAGAGAGCTCTCTTTCCGCCTCCTCCCCCT 

CTGCTGTGGCTTTGGTGAGACGTGCACTCAGGCTCACCTGTCTCCATGAT 

CTCCAGTAAGTACACATGAGCAGAGAGGCCTCAGCTCAGCTCTTCCTGGT 

CCCACCAGGGTTGATTCTTTGAGAATTCTAGAATGCCACATCCTAGGCCC 

CCCAAAGAAATCCTGCATCTTACCCCCAGAAATATGAATCATAGCAAATT 

TCAAATCAACCATCGTTTAATACTCACAGACTGGGCACATCC^AAAACAT 

ATTTTCAGTTTTACAACAGTGCCTGGTGCATATCGGCACTATTTGTGGAA 

GCAATAAATCGACACGGAGCTGAAACACAAACAAATGCCAAATTGTTTTT 

ATAACACCTGATTTTCTTTCTGTTTCTTTATGCAGTTTAGTTTTGTTTTG 

CTTAACTCTACCTCAGACCATAGTCTGGTAAACTCACCACCCAGAAGCTC 

CCTTGAAATGTGGGTATGGAGCCACTAGGTGGCAGGAGAGAGTTTCCTGC 

CTGGAGGGAGGACAGCCACTCTGTCCCCGGGTCAGGCCAGGGCCACCCTG 

CTACCTGCAAAATTAGCATGGGGCTTTATGAACCACAGCTTCCTAATAAA 

CACAGGATCTGTTTGATAGAGACTCCAAAACACGCCTACCTAGTGATGAA 

AGACTCAACTTCAGAAGAAAACCTTCATGGCAAACATCTTCAGAGATGTT 

TCCAACTTAAGGTTCTGAACACAGACGCTTCCCCAGAAAGCCATTGTTTC 

TCAGCACCTGGGAGCCTTGCTTTGCTTTGCTTACAGACTCGCTGTTCTTA 

AATCACTGCCAAGATAACATCTGTCTCTTCTCTTACCCTCTATTTCGATA 

TAAGGACTCCTCACTCTTGTTGCTTCCTATTGGCTACCTCTCCACAGGGA 

GAAATCGCTGATTTAACAGCAGTCAATATCCCAAATCTGGAACAGGGAAC 

AGGGAAGCATTTAAAAATTGGAGAATTTAGGCCGGGCACAGTGGCTCATG 

CCTGTAATCTCAGCACTTTGGGAGGTCGACGTGGATGGATCACTTAGGAG 

TTCGAGACCAAGCCTGGGCAACATGGCGAAACCTCATCTCTACAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAACCCAAAAATTAGCCGGGCATGGTA 

GTGCACACCTGTGAGCCCCAGCTACTCAGGAGGCTGAGGTGGCAAGACTG 

CTTGAGCCCTGAGGTCGAGGCTGCAGTGAGCCGAGATCACACCACTGCAC 

TTCAGCCTGGGCAACAGAGTGAGACCTTGTCCCAGATAAATAAATTAAAT 

TAATTTAATTAGAGGATTTAAGGATTTTCCCTACAGACACCTCCTTATTT 

TCTCTGGCCTTTTCTGACTACTCTCCCTAACTCCCTGCTCCTCTGGTCTC 

CCAAAACTACTCCAGAAAAAAAAAAGGGGGGGAGGGACTAAAGGAAAGCC 

AGGTGACAGTGCCAGTGTGACAGATGACAAAGCATCTGCCCGAACAAACC 

GTAGGTCCCTGAACTTTCTCCAAGACCTGTCTGTGGACTTACCTATGAAA 

ACCAGTTTTAGCAAAAACCCTCCTAAGCCAGTTTATCAAGATCCCCTTAT 

CCTCAATATCCATCTGATTGGATTCTTCATCCCCCACCATTCCCCAGTGA 

TGTCACCAGGCCTTTCTTCAGCAACAGTAGTTAGTGGGTGTAGCCAGGAC 

GCCCCCTCACCCCTGATATGCCCTTTTAGTAATTCTTCATCCACAGGTTC 

CCACCCTGCTCCTAGGCTATACATTCCCATTTGCCCATGCTGCATTCGGA 

ATTGAGCCCAGTTCTATACTGAGGTCTTACTTCACCTCTCGCCATAGTCC 

TGAATAAAATTGGTTTTCACATTTAAAAACTGTCCAGCTCTGGTTGTTCC 

TTGACACAGGGTAATTTTTATTCCATGTGATAGTTTGCCTTACCTCAGCC 

TACACCCCTCAAACCTGCAACTCTATATTCAAGAACCAGACAGCCCTTTC 

CAACAGATAGGAAGAGGCTGCCCTGGTGCAAAGGAAGAGGCTCTGGGAGG 

AAGGAGAGAACCCGAAGGCTGCCCCCTCCTCTAGACTGAGCTCTGGGATG 

GGTGGACGATAAAACCCAGATACGTTTAGACATCTGAGCGTGGAGAGGAC 

TTTGCTTTGCTTCCACAGGGACCCCAAGGAAACTGCAAGCCCTCCAGAGA 

CTAAAAACAGCAGAACAGCAAGAAATGGCAGCAAAGGTCTGGGCAGAATC 

ATCCTATGTGGGCACAGACACAAACAGAGTCCCCTGTGGCCCCAGGAGAG 

TTTAAAGAAGATCCAGAGGCTGTCCTATTCCATATCTCAGCAGAGACAGG 

CCCGTGAGCCTAAAAGCTGATCATTAGGACAAGAAGGACACGAACTGTCC 

TGCAGCG7GAACCGCGTGGAACAAGGCCAATCACCAGACACCAGACCAGC 
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CAGACACAGCCCCGCAGTXCCCCAAGACCACCACGGACCCATCGCCCCTC 

ACCAATAGCTCCAGGCTACATAGACCCCCTCCACTTCATGGATGTCCTCA 

GAGCAGAAAGGGGAGGCAGGAGTGGAACCCTGACTTGGTTCAGTTGAAAC 

ATAAAATGACTGTACTATTATTGAATTGCTGAAGTTTACGTGAAAGAAAT 

GAGATTTAGTTTTTGGCCACAGTGCAAAATAAGAAACGAGGCTTCAACTG 

AGATTAAGGTGAGTTATAGGAAAATGTACTCCCTTGAAGGACCTGTGAAG 

TGXGTTCGCTATGAGAAAATGACCAGAATCCACGTTCTTAGCTGCGGGAC 

TCAGGCTGACTCCTGTTTCTGGAGCTTGCACAAAGGGCAGGGAAATCCCT 

GTTTCAGGCACAGTGATTTCAATGTTTAAAAGAAAACAGGTGGGCCCTGG 

CAATCATGATAACATGTCATAAGTTTACATCTCTGTGAGGCAGGTAGTGT 

AATCCCCATTTTGCAAAGGAGGAAACCGAGGCTGAAAGCAGCTACATGGT 

CTCTTCAATGTGGCCCAAATGTTGGAGAACAGAGCTTAACTGAATCAGCA 

ATTCTATACTTAGAACTGACTCTCTCTTTATTATATCTCACTACTACCTT 

GATATTTGAAATATTCAACTTTTTTCAATCAAAAAATAACAATAATTTAG 

GCATAATGACTACTATGTCATTTAATTTCTTGCTGATATTTCAATATCCC 

ATGCCAGGAATATTGAAAGCTCAGCTCCTTAAGAGCTGACTATGGCATCA 

ACTCCCAACAACCATCCTTCCAGAAATATTTTCCCCTTTCTTTTGTTATA 

GAGTGGCACTGCCCTATATGGTGACCACTTGCCACATGTGGCTGTTGAAC 

ACTTGAAATTGGCTTGTCAGAATTGCAGTGTAAAGTGTAAAACACATACC 

AAATTTCAAAGACATGGCACATAATAAAAAATGTAAAATATCTCAXTAAC 

AATTTITATATTGACTGTGTi^GTAACATTTTGAATATATTGGAT 

ACATGGATGATGCCCCAACACCCACAGTCCCTTATCAAGTCTCTACTTCA 

CATTTTTGTACTTCTGACTTAGAAATAGCACTGGCGTCTAAGAGCCTATT 

AATGTCGTCAATAGGTTCTTGGGAACCACAATTTTAAACAAAATGACATA 

TAAGAAAACGAATAACATTGAACAAAATGACATTATTCGAGGACCTGCTG 

CATGTTGTTTCACTTAAAGTCAGTGTCCAAGAACCTATCAGTGACATTTA 

GTGAGGACTTGCTGTCCTTCCTGTTTACAGGAACCTGGGCAAGTTACTTA 

ATTCCTCTAAGCCTGGTTTATATCCCTGCAAAGAGAGAAGGATAATAATC 

ACCAGTACTTAGTGATGTCGTAAGGAGAAAATAAAATAATAAATATGAAA 

TGGCTGACAGTGTCCTTGTCACACAGAAGATGTGTGATCCACAGTAGCTG 

CTATTGTCTGCCTCACTTCACTAGTAATGGTCCAGGGAGGCCTTTAATGT 

GCATGGTGCAGTACATTCACATGTTGGACATGGGTGAAGGGAAAGACCAG 

GCTCATCTAAACACAATAGGATGCTTGTGGTGTTTTGAGGAGGAATCAAG 

GACTAGTTATCCACAGCTGTAACATGCATGGATCAAAAGAGATAAGGCAC 

ACAAAAGACT7TGTCAGTAGCAAAGCATTACAAAATGCAGAGACCAGCTG 

TGGGTGGTGGTGAGTCAGACCCAGCTTCCCTCTGTGCCTGGCTGAGTGGT 

TCTGGGCAAGTCACGCCATCTGTCTTGATGCCCTTCCCCATCTATAGAGA 

GGGAGCAACTGAGGCCCCTTCCAATACTGAAGTCCTTTATTTCTGCTACT 

TTAGAAATATCCACATTTTTGGTAAATTCAAATGATCCAATGATTCCATT 

TCCTAATGTTCAAAACTAGCCCCAGAAACATCTAAATGAATCAAACAAAT 

AAAATATTTATTGTGTATGTTTTGATTGCTGAAACTTCTATTTTAGCAAC 

ACACACACACACACACAGAACCCATAAGCCTTCATCTTTCCTTGGATAAA 

CGAGCCTTCCTGTCTGGCCATTTAAGTCACGATTAAGTAAATGATTTCCA 

ACTCGCCTTTTGCAGCAGTTCAGATGGGTCTTTCCTGCGTGGCAGTGGCC 

CTCCTGACTTATGATTTCCTGTGTGTCGGCeTGTTACCACTGCAGCTTAA 

CTGAGGAAACAAGAACAAAACAGCTTCTGACCCCAAGAGACTGTTGGAGG 

CAAAGGCTTCAGTCCCAAGAACCTCACACGTGGGGAGCCCGAGAGCCCAG 

CCCTGACCTTTTCTCCAGTAATAACATAAGAAACAACAGGCACTGGCCTT 

ATTTTGGATACAAAGAGTGGTGCTTTTCCTTAAATCTTCCTTTAGTCAGG 

GCTACCCCTTCATGGACGCCCCAACATCCATGGTTCCTGCTTGAGTCCCT 

GCTTCCATATTCCTGCACTTCTCACTTGAAATATCCCTGGAGTACGTTAA 

GCAGCCAGGTTTGGAAGTTCTTGCTGTGCAGGCGGGTGTGTGCATGTCCT 

CTCTCTCAACAGGACACAAGCTCCCCAAATCAGACGGTATGCCTCCACGC 

CCCTTCCCAAGCCTCCCCAGCAGCACCGAGCATGTGAGGGGAGCTGGGGC 

CCAGGCCATGATGGGAAGCACTCTCTGCCTAAAGACTAGGGTGATGCGCC 

CTCAACTGTGGGAATGAGCCCCAGCTCTGGTGTCTGCCTCGGTTTTTCCT 

CCTGGACAATCAACATGAACTCCTCACCCCTCTTATCCACTTTGCATAAA 

CTGAAAATAACAAACCCAGGGCTCTTTCTGTCACAGGAAAGGGTTTTTTT 

TTAT AAAATTAAACAG AGATG ATT CAACACAC C CAGGATAT AACACATGG 

GCCATGAATCAAGGGCAGCATTGCTCTGGTCAGCCTGTTGTTTGGGCCCC 
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CTTGGCAGGGCTCTCCCC rGAATCTTCCCCTCTTGACTCCCATCANCACA 

GCACTCCANCTTTGTGTTACAGGCGATAAATGGGAAAGGGGTAAAT 

>Contig48 

CATTCTTAATTAGAGAAACGCTCATTAAACTAGACACCCAAATTCTCTGG 

GGGGGGATCATTCTTACAAGCATGCCCTTCTCTCTTAAAGAGAGAGCACT 

TTTTTCGCAAATAATGCTGCCATGAACATACGGGGTGCATGTATCTTCGT 

AAJAGAATGATTTCTATTTTGGGGGGTATGTACCCAGCAATAGGATTGCT 

GGGTCAAATGGTATTTCTGGTTCTAGATCTTCGAGATCTTCCACACCGTC 

TTCCACAATGGTTGAACTAATTCACATTCCTACCAACAGTGTGAAAGCAT 

TCCTATTTCTCTGCAACCTCGCCAGCACCTGTTATTTCTTGACTTTTTAA 

TAATCGTCATTCTGACTAGCATGAGAGACAGTATCTCGTTGAGGATTTGA 

TGTGCATTTTGCTAATGATCAGTGATGTTGAGCTTTTTTTCATATGTTTT 

TTGGCTGCAAGAATGTCTTCTTTTGAGAAGTGTCTGTTCATGTCCTTTGC 

CCACTTTTTAATGGGGGTTTGTTTTTTCTTGTAAATTTGTTTAAGCTCCT 

TATAGACTCACAATAACAAAGACATGGGATCAACCTAAATGTCCATCAAT 

GATATAACGGATAAAGAAAATGTGGTACATATATACCATGGAATAGTATG 

CAGCCATAAAAAAGAATGGGATCATATCCTTTGAAAGGACATGGATGAGC 

TGGAAACCATGATCCTCAGCAAACTATGCAAGAACAGAAAACAATTGTTG 

CATGCTCTCACTTATAAGTGGGAGCTGAACACTGAGAACACAGGGACACA 

GAGAGGGGAACAACACACATTTGGGGCCTGTCAGGGGTGAGGTGGGGGAG.. 

GGAGAGCATTAlSGAAAAATAGCTAATGCATGCTGGGCTTAATACCTAGGT 

GATGGGTTGACAGGTGCAGCAAATCACTGTGGCACACATTTACCTATGTA 

ACAAACCTGCACATCCTGCACACGTACCCCAGGACTTCAAAATAAAGAGA 

GACAATACTTCTCCCTTAAGTGTCTACTGTTGCTTTGCAATAAAAAGTTC 

CTGCCTTTCACTTCACTCTGACTTGTCCCTGAATTCTTTCTCGTGATGGT 

GTCAAGAACGTGGACACTGGCTGGGGCTGGAGACTCACCAGCATCCGGAG 

ACCCTCCTGAGCCCTCCAGCAATACAACTTTGACACAAACTATGAAATCA 

CAGATCCAAGAAGCTCAAAGAACCCAAGCACAGGAAACATGATGAAACTA 

CATGAAGGAACATCAGAATTGAATTGTTCAAAATCAGTGATAAAGAGTAA 

AT CTT AAAAG CAAC CAGAA CAAAAT AT C CAT CAT AT ACG CAGAAAT AAAG 

ATAAGTATGACAGCAGATTTACAAATAGAAAAAAAAACAAGTGCAGCAAC 

AGAAACAAACTATCAATCCATAATTCTATACCTAGTGAAAATTTCTTTCA 

AAACAAAGGTGAAATAAAAAAATTATTTTCAGGAATACAAAAGCGAAAAA 

ATTAATCACTAGCATTCATCACTGCAAGAAATGTTAAAGGAAGTCCTTTA 

GGCAGAAAGAAAATGATACAAGGTGAATATTTGGATCCCTGCAAGGAACT 

AAAAAGATCCAGAACTGATAACTTAATGGGTAAACATGTAATTTTCATCA 

ACAAGTGAATGAATAAACAAATCATGATATATCCATATGATAGACTACTA 

CTTAGAATACAAAAGAAGAACTACTTATGCATGTGATAACATGAATGATA 

TTCAAAATTATTATTGAGTGAAAGACACCAGATCAAAACAAAGTACATAC 

TGTATGATTCTGTTTATATAAAACTCTATAAATTGCATGCTCTTCTATAG 

TGACAGAAAGAAGATCAGTGGCTGCCTGCAGACAGGAAGAGATTACAAAC 

GGAAATGAGAATTCCTTAAGAGATGATGGACATGCTCATTACCCATCATA 

TGTATACAGCCATAATGGTTTTACAGATACATATATATGTACACGCCAAC 

ATAAATATAAGTTATCAAATTACAGTAAGTTCTGACTTAATGTCACTAGG 

TTCCTGGAAACTTTGACTTTAAGCAAAATGATGTACAGTGAAACCAATTT 

TACCATAGGCTAATTGATATAAAGATGAGTTAGG TTT TTGGTTTTTTTTT 

TTTTGACATGAAGTCTCGCTCTATCGCCCAGGCAGGAGAAGAAGAGTTAG 

GTTTTACAGCATGTTTCTGGTCACAAGAACATCATCAAACTTGTAAATAA 

AGGCACAAAACACTTCTAATATTAAATATCAAAATAAATATGAGTTATAC 

AGAATTTAAGAAAGATTAATAAAAACAAGTAAAATCATTATTTATGGGAT 

TTTTGGTAATCAGTGAGTTATGTGGTCATAGTGGAAGTGGGTTAAGTCAA 

GAAATAAATGTTTGCAAAACAAAAATTTTAAAGATCCTCTCCTACCACCA 

CACAAAAAACAAGAAAACACGGTGGGCTCGCTAAGCACTTTTGTACCACT 

CGTATCTTATGCGTTTGTATGATTATTGTAAATGCTTTATGATAATTTTT 

AGAGACAGGGTCTCACTCTGTGTCTCAGGCTGGAGTGAAGTGGTGCAATC 

ATAGCTCACTGCAGTCTCAACCTCCCGGATTCAAGAGATCCTCCCACCTC 

AGCCTCCAGTGTAGCTAGGACTACAGTTGTGTGCCACCATGCCCATCTAT 

CTTCTTTTTTATTTTTTGTAGAGACAGGGGTTGTGCTTTGTTGCCCAGGC 

TAGTCTTCAACTCCTGGGCTCAAGCAATCCTCCTGCCTCAGCCTCCCAAA 

ATGCTGGGATTTCGGACATGAGCCAGCAGCACCTTGCCCAGCATTTTATT 
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TCATAATAATTATAAGTCZATTCCTTCATTCATCTTACAAC CCACTTGTTC 

CAGTTCAGGATCTCGGGTGACCAGAACCTATTAACGTTCACGCACAAGTC 

AGAAACCAGCCCTGGACAGGACACCATCCTACCGCAGGGAGAACTTACAC 

ACCCACACTCACTCAGACTGGGACCATGCAAAGAACCTAACGTGCACTTT 

GGAATGTGTGTTCCATACCCACTAGAACAGCTAAAATTTAAAAGACTGAC 

CATACTTGAGTGTTGAACAGGATGTGACACAACTAAATCTTTTAAGCGCT 

TC^GCGTAAATGGCACAGCCGCTTTGGAAAACAGTTGGCAGTTTTTCAAG 

TTAAATATACCCAAACTCTATGATCCACTTCTCAACAATCAAACAAGAGA 

AATAAAAGCAATGTCTACACAAAGATGTATACACAAATGTTCATTGCAGC 

CTTAATTATACTAGCCCCAAGTTGAAACAAGCCAAATGTCCATTACCAGA 

TGACTGGAACATACAAATTGTGGTATATTGATACAATGAAATACTACTTA 

GTAATAAAAAAGAAAGAGCTATTAACATAAGCAACAACATGGATGAATCT 

GAAAACAATTATGCTAAGTGAAAACAGCCACACAAAAGTTACATACTGTA 

TGATCACATCTACATAAAATTACAGAAAAGGCAAACTAATCTATAGACAG 

AAAAGCAGATGAGTGGTTACCTAGGGATGGGGCAGAAGGGACGAAAGGAT 

GGATTGCAAAATAGCACAAAAATATTGGAGGGATGACAAATATATTCATT 

ATCTTGATTGTGGGGATAGTTTAATGGGTATATATAGAGATCAAAGCTCA 

TCTAATTATACACTTTAAATATATGTATTTCATTGTGCATCAGTTATTCA 

TCAACAAGACTATAAAATAATATATGCCTACATACATTTTTAAATATTCA 

AAATCTCACAGTTATATACATAAATGCAACTGAATATGTATTCAGA.TGTT ' 

TTAACAAGCAGAAAGGACTGATTAAACTCATGACAGCGGCTGTTTCTGGG 

AAGGGTGTAGGAGACAAGAGATGGAAAAGAGGATGAGAGCCAGAAGAGAC 

CCTTGTAATGTTTCCTTTCTTTTAGTAAAAATATATTGACAGTTAAAGCT 

GAGAGGTGAGAATAATAGTCTCATGGCTTTTGTGTCCTTAAAATTTCACA 

AACTAAGTGAAATGGGAGAAAGCAAAAAAATAAACTTAAATAAATGTTAT 

ATTGCCCAAAAAGAGATTTAAAATGGAGGTTAGACACATGAGACTTACGT 

TCTCAAAAAAGTAGAATCTGCAGGGAAGTTTAACAACTATAAAGAATTAA 

AATCTAGCTTCTACCAGCCCAAAGCCTAAAATGTTCTGCTTTATTCTTCC 

TTATTATAATTCATAGGTAATATATTTTATGTTTGCAAATGAATGCAGTG 

ATATTAGATCTCTAAGAGGTGCTAAAAATGAAAAGTAGATATTCCAATTT 

TTCCCAATTTTCCTTCTCTTTCCATGAATGAAAAATATACATATTTGATG 

ATTTCCAAGTTTATACAACCGATCTTTCTCTTAGTTTTCTCTTACCAAAT 

TCCCTCCCTCACTCAGCCACCAGCCAGTCCAACTGTGCTACCTGCACAGC 

AGCCCTCATACCATCCACACTCTCATCAGGATCCTGCCTGACCTGCGAGG 

AGCAGCAGCAAGAAGGAGACAGAACCTCCACGCTGAGCATCTCAGGGCTT 

TCTCAGAGACTCCAGAGGACCCTGATAGGGACAGAGCCTGGCCAGCAATC 

CATGCTGCCAGCTGTATGATTGTGGGCATGTAAATTCTCAACTGAAAATG 

GGTGTAATAATAACATGTTCTTCCCAGAATGAGCTTTATGAAGATCATAT 

AGCTGTTTGGAACTCAGACAAGCACTGGTAGGAATACAAACAGGGGAGCC 

AACAGCCTATAAATAATACTTTAAGAAAGGGCATGAATGTAATTACTTAG 

GAACAAAAGGCAAAGTGGAGAGATGCCTAGGACTGAGCTGGACAAGCTGC 

ACCCTTTAGTGGCTCAGCCCATGGGCTGACAAGGAAAATGGAGGAGCTAC 

CAAAGAAGGTGGAAGGATTCTGGGAGAGTGGCCCTCACCCTGCCCAGGGC 

AGGGCTCAGTGGGAGAGAGGGAGATCTGTTATAAATGCTGCCAGGAGGTC 

GAGTCATGTGAGAATGTCCATGTGAAAACATCCACTGTGTGTATCTAAAG 

AGAGTGGCTGTAAAACAGGTCAGGGTCAAAGGTCTTATTGTCTCAGATGT 

TATCTGCATGCATTGTCTCACGACCAAGAAAACTAAGGAGCATGGACACA 

AAGGGTTAGGTTGAAGCAAAAATTTAATAAGTGAAAGAAGAAGGCTCTCT 

GCAGTGGAGAGGGGAGTCTGAGTGGGTTGCCACTTTGACAGCTGAATCCA 

AAAGCTTTTATAAGAAACTCTTCTCATATCTGCAGCTGTTTGAGTAACTT 

CTCTTACCTATAAAACTGTCTGTATAACTCTCCCTTATCTATGCAGCTGT 

GGGATGTCTCCAGGTAAGCATAAAGTGTAGCTTCTCTTGTTTGTATAACT 

GTGGGTTTGTTTTAGGCAAGCCCCCATCCCCTCCCTGTGTAAGCTCCCAT 

GGAGCCCACCATGTGCATATCTGAGAAGTGGAGGAAGCTTTCTCTGGGAG 

CTCACTGATCGTACAAAGAACAAGAGGCTTCTGTGCCGCTTATCTATTCA 

GGTGCAGCCTGAGTTTTCCCCAGGCTGCTCTATTTTTGCCTGTAGCTATG 

ATTTTTCAGGCAGGCTGCTTCTCTGAAGACTAGCCTTAACTGTCTACCTA 

TCAGATTTTTCCTTTTCTTCTCCCTCAGCTGGTTCCCCTCACCAAGGCTG 

AGCAAGTGAAAAGGAGGGCACAGGGCAGGCCAGTAGTGAGCAGCAACAAG 

GAACTAAGACAGCAGAAACCACTCTTCACACCTGGGTTGAAAGGGGTGGG 
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GAGCCAGGACTACAGC1 UVGGTAAGAACATAGGTAAAGAGATACTGTTGT 

TGTGTTGTTTTTAACTATGAGAAGCATTGAGCTTTAAATTTCTACAGGAA 

GGAT C CAGTT GAG ACAGGAGCAC C CAATATTCAGAAGAGAAGAACATGGT 

GTAAAGGTCCTGGGAAGGCTGAGAGGATTGGGACTCAGAATCCAGAGCAG 

AAGCCG7CTGTGAACAGAAGAAGGACCTCCCCCAGTGTAGCAAGAGGGAG 

GGAGGAGGGACAGATGCCAAGATGGTTCAGGAAGAAGGTTTGGTGGTAAA 

TGJGAGGCTGTGCTCACCTGCTGGCTTCAATTTTCTCTTTAAAATGTCAG 

ATGGAATCATTTGATGAAGGCCATGCCATGCAATGAAATGGCAGTCTGAG 

GCATGGAGCAGCTCCAGCTTAGCCCGTGTTTAGGGTAATTATGGCTCCAA 

CCCAGGAGATGAATATGACTAGGGAAAGTGAAGTCCAAAAACAAATGGTC 

TCAAGTTGACTGTGAGTCTTCTGGGAGGCTGAGACGACAGGTGGGGTTGA 

CAAGGGAAGGGGAACCCACCTGCTGAAAAACATCAGGCTGTTGGCTGGGG 

GAGGGGTGAGGCCTGTGTTGTAGAGATGGATGGATGCCTAAAGTTGGGTA 

AAGGTTTCAACTCTACCCTCTGCTGGGTGTGGAAATAAACAAAGACCACC 

CAAATGAGAACAAACAAAGACTATTTATCCAGAGCTTGCTCTGACAAGGG 

AGTCGGCAACCATCACTTGCTTGGCAGAGACTCAGAAGTAAGCAGGGGAG 

AAAGCCTCATAGCAGAAAGAAGGGAAGTCTTCATGTATGCCCTGAGTGGC 

AGCTGTAGATGTGGGTGAGTTGCAGGTGGCTAACTAGAAATGGGGGACTC 

CTGTGTGATTGATTAGGAGCATGTTTGGCTTTCTCTGGTTGGTCCTACAT 

TGGAAGAGGGAACAAAAAATTTAGGGCAGTTGTCAGTTATTAATCAAGTG 

TTGGCCATTTTTGACTGACTGTTACAGGAGTGACTGGCTCCCTGGATTGT 

TTGCTAGAAATAGTGGTCTTCACTTCCTGCAAGTCTGACTTTCTGGTAAT 

AGGCTTCCTGGGTTGGCTATTGTGGATAATAAGTGGGTTTCCTGAGCTGA 

TTTCTGCAGATTGTGGATCAGAGTTATTTTATATAAACAGTCTGACCATT 

TTCCACTGGCATATTCCATCTTCCAAGAGCTGGCCAAGCTGCTGTCTTAT 

CTGTCTCCCCCAGCCCCTCCACTCTGGCTGTGAAAATACAAGCCACTAGG 

TGAGGAATGGGGACAATTGAAGACTGAAAGCTTTTCTTTGCTGGGTTCGC 

AGAGCTGAGGAAAGAAATGACAACATCCAAGTGTCTGCCCTGGGCCAGTT 

TTAGGACTGTAGTGGTAATGCAAGGACTGTGTGAGTTTATATTTTCATTT 

GTCTCTCTAACTAAGGTGGAAAAAAAAAAACAGAAAATTGTCTGTCTGCA 

GTCTCTGCAAAAGTCTAACACTGTGCTTCCCAACATTGCAGCCATTAGCC 

ACAGGTGAGTATCAAGCACTTTAAATGAGACTGGTCCAAACTGAGATGTG 

CTCTGAGAATAAAACACACAGCAGATTTCAAAGACCTAGTACATGCCCTG 

ATTTCAAGCTATATTACAAAGCTGTGGTAATCAAAACAGTATGGCATTGG 

GAAAAAAATAGACACATTGGTCAATGTGACAGAATAGAGAGCCCAGAAAT 

AAACCCGTGCATGTATAGTCAACTAATCTTTGACAAGAGTACCAAGAATA 

CACAATGGGGAAAGTCTCTTCAATAAGTGGTGTTGGGAAAACTAGATATC 

CACATGCAAAAGAAAGAAATTAGACCCTTGTATTACACAAAATCTAAAAT 

TAATT CAAAATAGAAAAAGACTTACATGTAAGATCTAAAACCATAAAACT 

CCTAGAAGAAAACATAGGGAAAGAGCTCCTTGACACTGGCATTAGCAGTA 

ATTTTTCAGATATAACATCAAAAGTACAGGCAATGAAAGCAAAAACAAGT 

GAGAGTATATCAAACTAAAAAGTTTCTGCACAGCATAAACAATCAACAGA 

GTAAAGACATGACGTATGGAATGAGAGAAAATATTGACATCTGACAAAGG 

GTTAATATCCAAAATATATAAGTAATTCACACAACTCAGTAACAAAAGCC 

AAATAACCTGACTTTTTTTTAAAATGGGCAAAGTACCTGAATAGGTATTC 

CTCAAAAGAAGACATACAAATGGCCAAGAGATGTATGAAAAGCTGCTTAA 

CATAACTAATCATCAGAGAAATACACAAATCAAAACAAGATATCATCTCA 

CACCTGTTAGAATGGCTATTATTAAAAAATGAGATAAGTGTTGGCCAGGT 

GTGGAGGAAAGGAAACCCTTGTACATTATTCATAGGAATGTAAATTAGTA 

CAGCCATTATGGAGAACAGTATGGAGATTCCCTAACAAAATTAAAAATAG 

AATTACCATATGACCCAGCAATTCCACTTCAAGGAATACATTCAAATACT 

ATCAGTATCTCAATAAGATACTTGCACTCCTATGTTCGTTGCAGCGTTAT 

TCACCATAGCCAAGATACAGAAACAAGTTAAATGTCCATCAACAGATAAA 

TGGATAAAGAAAATCAGGTACATATATATATACAATGGAATATTATTCAG 

CAAAATCCTGACATCTGAGATAACCTGGATAAACCTGGAGGACATTATGC 

TAAGTAAAATCAAAGCCTGACACAGAAAGACAAATACCACATAATCTCAC 

TTACATATGAAATATGAAAATGTTAATTTTATGGAAACAGAGTAGAATGG 

TAGTTGCCAGAGCCTGAGAGTAGAGAAAATGAGATGCTTGTCAAATCAAA 

TCATCACATTGAATATATATAATCTATTTGTCAATTAAATATTTTAAGAA 

TAAAAAATACCTGGCACCAAAAAAAGAATGCAAAATGTCTCAACAATGTT 
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ATATGTATTGCATTTTG^AGTGATAATAATTTGAATATTAGGTTAAATAA 

AATATATTTGAAAAATTAACTTCACCTATTTCTTTCCATTTTTGTTAACA 

TAGGTACAAAAAAAAATTAAAATTACCTATGTGGCTCATGTAGGTGGCTC 

ACATTATACTTTGATGACACTATACAGGCTGGTGACCATATATCTCTTAG 

ACTAGTCTAAGTGATTTAACAGTGGTTCCAGAAAGATCCAGGTTTAACAC 

CAATGAAAGGGCCAGCTGGCTTAGCCCAGCTTGTGTGGGAAATGTTGGGG 

AGXGGTT7AAGACAGGGAAAAGCAAAACTTTTGATGCTATTGACTTTTTG 

AAAAATCTTTTGTGGCTGAAAAAACCAAAACATTATT 

>Concig4 9 

GCTCGAGTGTGTCTCTAAAGCCTTTCCCCCATTGGCTCCACTATACGCAC 

TCTCCTGGTTTCCTCCCCTCTAGCCGCTGTCTTTGGTCTCCTTTCTGATT 

TTGCTGCGTCCTCTGTCCCCTGAATGATTGCTTCTCCACTACGGGGTGAT 

TTTGCTCCCCAGGGGACATTTGGCAATATCTGGAGAGGTCTATGGTTGTG 

TTTGAGGGTGTTGCTACTGCCATCTAGTGGGGAGAGGCTAAAGATGCTGT 

TAATGCCCAGGACAGTCCCCATAACACAGAATTATTCAGCTCAAAATATC 

CATGGTGCCAAGATCAAGAAACCCTGCTCAAATATTAGCATGTGCTGAAG 

GCCCTTCTCTTTCCTTTAGCAATATCTGCCTCCTTAGGGATCTTTTCTAG 

TCTCAGTGGTTTAACATTTAAAATCCCAAATTAGGCAATAAATTGGGCCC 

CAAACTTCGTTAGTATAAAATGTAGAACTGTGTTATTAGAAGGCTAATAA 

AATGACCTGGTGAGCATCTGCAGCTAGCCTCTGAGCAATTCTGGGGACCA 

CGTGCAAGATAAATCCZATCTGTTCCCTCTCTGTAATGTGGCGCTACCTTG 

TGGCCGATTTTTCCTCGGGTTAAATATCTCTGGGGATGCAACTTGTCGTG 

GTTAATGGCTGTGTGAGGCCAGCGCGTGGTGATAAAGGAATCAATCAAGA 

CAATATTGAATTTAGAAAGGCAGATTTATTTAGAGAAAAGGAGAGATACG 

TTGCAAGGGAGCAATGGGCAATACAGCAGAGGGAAGGCTGTCTGCAAAGA 

GGCAAGGGCTACGTATGACGTAGGGCTGCTTAGGCTGAATGCTTGCAGAC 

AAGATGCTTGCGTGCAGGTGGGCTGTGAGCTGAGTGCTTGGGTGCTAGTG 

AGCCATTGGCAGCTGACCCTATTTCTTGGAACATTCGCTCCCTGCAAGCA 

TTTTAATGTTAAACCGCCAGGTCAGTTTGAATTTTCTTTTTTCTTTTTTT 

TTTTTTTTTTTTGCCTTTAGTAGGACCTGCCGTTGTGAGACTATCTGAGG 

TAAATTAGACACCCTCCTGGTTTAAGTCACCGCTCCAGTGACTAGGCAGG 

GAGCTCTTCCTTGAAGAGGGTGTGGGCAGTGGGTACTTTGCATGTTGTCC 

ACACCAGGCGAGCTGCTGCTTCAGGGCCTTTGCATTTGCTCTTTTCTTTG 

CCCAAAATGCACTTCTCTCACTGTTCACATGATTTTTCTCCCTCTTTTCC 

TTTTAGTCTTTGCTTAAATATCACCTTCTAGGGAGGCCTTCCCACACCAC 

CTCTTCAAGATTTGAGGGTATGCACCCCCACCCCTAGCCTTCTTATCCCT 

CTCCACTGCTTTCTTCTCAAAGCACTTGTTACGTTCAAATAAAATAGATT 

AGTTACTTTATAGTTCTAATTTTACTATTTTTTGTTTACTTCATCAATAC 

CCATGTAATCTCTGGAAGGAACGTTTCTTTTTGTAGTGTATTTCTAGCAC 

CTAGAACAGTACTTGGCACATGGCAGGTGTTCAAAAGTATTTGTTGATTA 

TTTTCTCAAAGGGCATGGAGTCTTAGAAGTTTGAGAACACAGTTCTAAGC 

ACAGCTGTTTAGAGACTATGGATGATGCTAATGGCTGTATTCCCAGTAGG 

TGGGGCAATTCTCAAATTGACCTGGAATCCTTGAGATCTGGGGACAGTCA 

CCAAGCACTGGGCTCTGTGGGGAGAGATGTGCTGGTTTTTAGAGAGGAGA 

ATAGCATCCTGGGGGACTTGGCCCCAGGGCTTTCCTGTCCCAATCTCTTC 

CCAACTGAGTCCCAGAGGCAGGAGGCCTTGTCTGTAGCTGGTCAGTCCTG 

TAACTGTTTCCCTCCCATCTACACAGATGCAAAGAAGGCTGAGAAAAGCA 

AGCTGTCAGGTGAGCAGGGGCCCTGACTCCTCCCCAGAAGGCACTCAGAA 

CTTCCATAGGGCAACTGGAAAGAAGGTTCTACTTCCTCACCGGCAGCTGT 

TGCTGGGGAAAAAACCAGCCTCAGGCCCTACCCTGTGCTGAGAACCTGAA 

TCCAGTATCAGGTTCTCCAACAAACTTGGATCCAGCTGACCCTCACAAGG 

GGTCAGATGCAACCTTGTAGCATATGGAAAATGGCAGCAAGGTCCTTGTG 

TGGACTATGCCTAGAATCTAAATTAAGACAAGGCCTCAGAGGGGCTAAGT 

GACATCTGTCTCCAAAGTTTCACAGCTAGTGTGTGACTAAATCTTGATTC 

CACCCTCTCAGGTTTTACCATAATCCCAAAAAAGGTTGAAACAAGAAAAG 

TTATCTTTGGGCAATTACCTCTTTCTGTTCCTTGCTTTACCTACTAATGT 

TCTAGGCTCACCCTCTGGTCTGCAATCTCACTGAACTGACAGATCCCTCA 

TGGCCTAAAGGGTTTTCACACTGGGTTGACTAGGCTCTCCCATTGCCTGT 

CCTACTGTCTAAGGCACCTCCTGGGTAGGGTGCCCAGCGTCATTCTGATG 

CTGCCTGACTTTCCTTCCAGCTACTTTTGAAACTTGGTATCCATGGCAGA 
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ggcttaaagggcatgttl jaggtacttttatttccaaattccccAIgtggc 

atcaaggaaatcagcatctctggatagctctactaaggcttagttctcat 

tgtccaatctagctcctgggtcatgggaggcattcaggaaatatttgagt 

gtaagagtgagttgctttacctccagaatatccttccaatggctctgaag 

caggc7gtggagtcctgctggctgatcacagttcacaggtggctcccaaa 

cctgtggtctacatccatcctttgtcagtgtcactgccattgtcccacaa 

atc5.tcatttgggcctagcccctgggatagtaatcagtctttacatagata 

7acattgtgctttacatccacagtaattctgagtggaccttaaaataaat 

tccatgtcaggtctcaccagcccatgggttacagatggggttacctttca 

gccttgtaaggtgccccgtctttgagtgtagacatggactcacaacgagt 

ccactcctgctgttcctctgctcttgctgaggcttctgctgctgctgctg 

ctgctttgcagaggctggccagctgtggtgcctgaggcacctgtgtcttc 

acagcaccaacttgcatggtggccacggtgtagttggaaagggatgctta 

gatgggaggccaatgggagctgcttcaggaggcaaatccaagtcacagag 

atcgagtcaccgagagcatagtaaactcaaaatcccttcttctgcttaat 

aactgagatgctgtcactgggttaacctcaccaagccttgttttgtcttc 

acttagagtgatttctgtcttagaaggctcctcatatccttctggggaag 

gcttctagtgagtccacagatagctggaccaggcatgtccagaaataatc 

tgattctcacatttgagttagccagcgttcccagctatatccccattttg 

tgtctatataagttaccaaagcccacaaggatattaggtggctccttagt - 

ttgctttatgattatgccttgtgtgtgtgtgtgtgtgagtgtgtacgcct 

atgaggattccttctctcccgttcttgctatggcttctcttccccactga 

tgggctgtagttccctgtccttttgactttgggcttagtcatgtgacttt 

tttgccaagggaatgtgggcagaagtaactgggagccagtcccaagctaa 

ggccttgggaagcatggtgagcctatgccagctccctcagaactccttcc 

cttggccatgaagagagaataacctggattgtaccttcagcccatgtcct 

agaatacaaacatggagaataatgaacttgactcaaaggctgaagggcag 

ctgagcccacatgaggtcaattgaactgcagctacctacagacctgaaag 

tgaaataaacatgtataagtctctgacgtttggggtttgtttacatagca 

ttattgtagcagaaacttaaataatactgggggctaaatatagtggacca 

gtgacagcacagaatggtaaaatggagtgattgttacttacatcacaacc 

cttcatctctgttgatggacactaaaatcaaagtggcaattactcagagt 

tgggagtcattgagttgcatcattgttgtttagaatcattgacagtttga 

gctctaagtgattacagagatggtttcctcagctacaggtaaataaacaa 

aggcacagagaagtaaagtgacttctagagggcttcattgatatttagca 

GCAGAATCAGAGCTAAACAATGAGTCTCTCATCTCCAGCCTTTCTATTCT 
TGTTTCCTAGGTTGGGATTTTGGGAAATAGTGCAGAGAGATTAGCAGTAG 
TGACATGGAACAATGTGAGCCTCAGCTTCCATCCCTGAGGCTGCCTTCAT 
CTGCCAGGGAAATGTCTCTGTGTGCAGCCTTGCCCTCTGCACACAGTGTG 
TATGGCCACCTGAATAAGTGTCCTTTCATAGCGACTAATGGATTGAAATG 
GGTGCTAGAGCAGTGCTTCTAAAAACTCCATGTATTAATCATCTAGGGGT 
CTTACCAAAAACGCATGCAGATTCTGATTCAGTAGGTCTGGAGTGGGGCT 
TGACATTCTGCACTTGTAACACATGGACCACACTTTGAGTAGCAATGTAT 
TAGATCATTCCAGTGGAAACATGTATGAGTGATGGAATGAACAGATATAA 
TTAATCCAGGTCTGGTAAGTGAGGTACTGATACATATTAAGTTGAAGTGA 
AT^CACATCAAAAATAATGGTTACACAGTGACTTTTACTGCCCCCAAAT 
TCTTTCCTTTTGAGTGGTTTCAAAGTGAACTGAGCCAGCCAGGTTAAGTC 
CCTG GTTT AGTGTGTGATTAGAAGATTTGATCCAGCTTTCTCCTCCTTCT 
AATTCTTTAAATATGCAATGGCCTTCTAGAAACTTGTCTCTCAGGCTCCC 
CATGAGCCACCTGTCTTAATATCTTCCCCCCCAGGACATTTCCTGGGTCA 
AGGA AGGA ATCAGGGACTAGGAAAAGTAGAAAGGTTGCCTGACAGTGAGA 
AACTTTTTGCACTCCTATTTGTTCAATTCTAAAATGTGGGTATTGTTGGG 
GCTTCTAATTGGAATCTAACCTGAAATTCAGGCATGTCTAGCTATATATG 
ACCAAGAATTAGGATGAGTTCACTAGAAGCCTATTTTCAGGAGAGCGGTC 
AGTTAAATTGAAGTTTATGGGTTTATGGTAATGGGTTGGGGAGTTTACTT 
CATTAGCAATAGCAACGTTTTTGAATCAGAGAAGTGATTTTGAACACACT 
GTACATAGTTTTCTCACTTAGATTTATCTCTGGGTCAACCCTTGTTGGAC 
CTATATTAGAATCATTTAGTGAAGAAAAGGTGGGTGTCATTAGGAAAAGA 
GCCATTTATTCAAATGTTCTGTTTGACATTAGGGCACTGGCAAGACTACA 
GAATCAATAGATATTTAAAAACAGCCAGGTGCGGTGGCTCACGCCTGTAA 
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TGAGCTCAGGAATTCAAGACCAGCCTGGTCAACACGGTGAAACCCTATCT 

CTACTAAAATACAAAAAATTAGCCGGGCATGGTGGCAGGCGCCTATAATC 

CCAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGGAGGCG 

GATGTTGTCATGAGCTGAGATCGCGCCATTGCACTCAAGCCAGGGCAAGA 

ATAACAAGACTCTGTCTCACAACAAACAAGCGAACATACGAAACAAACGT 

AACATCCAAACTAGCAGGTACATGCCGTGCCAGTCATGACCCATGGTCAT 

AAGATGTCTACAGCTCAGGAAGCAGCTGCACAATGCCTGCATAGACAAAC 

TCTTATGAAAGCAGAATGTCCTGATGTCTCCATAACACATAACAGTGTAT 

GCTTTTATTATGGTCATACTCTAGCTGTGATGTACCTACGCTCTAATATG 

CCAACGATAGTTTTCTTTAAATCATCAACATAATAAATGTCATGCTGTCA 

GTCCCCCACATGTAGACATAACTTAGCTGGTACATGGATAAGAAACCTAT 

ATTAGATAACCTTAGGCCAGGTGTGGTGGCTCATGCCTGTAATCCCAGCA 

CTTTGGGGAGGCCGAAGCGGGTGGATCACGAGGTCAGGAGATCGAGACCA 

CCCTGGCTAACACAGTGAAACCCCGTCTCTACTAAAAATACAAAAAAAAA 

TTAACCGGGCATGGTGGCAGGCACCTGTGGTCCCAGCTACTCAGGAAGCT 

GAGGCGGGAGAATGGCGTGAACCCAGGAGGCGGAGGTTGCAGTAAGCCGA 

GATCACACCACTGCACTCCAGCCTGGGGGACAGAGCGCAAGATTTCGTCT 

CCCAACCCAAAAANCNANNNNAAATTTGCACCCAAATCTGACTAATTCCA 

GAGCCAATTCCAATTTAGAATCGTTATATCTCCCTGGTGAACTGAAGCTT 

TTATCTTTAAGGAGACACACTCTTTATGTCTACCAATGCTtATTGCCTTA 

AAGTCCACTTTGTCAGATACAGCTGCTTTCTTTTAATTAGTTTTTGTGTG 

GTATATCTCTTTCCATCCTTTTTCTTTCAGCCTTCTCCATTCTTACATTT 

TAGATATATTTCTTTTTTCTTTTTTTTTTGAGAGAGAGTCTCACTCTCTC 

GCCCAGGCTGGAGTAGTGCAATGGCGCGATCTTAGC7CACTGCAACCTCC 

ACCTCCTGGGTTCAAGCAATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGG 

ATTACAGGAGCCCACCACCAAGCCCAGCTAATTTGTTGTATTTTTAGAAG 

AGATGAGGTTTCGCCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCA 

GGTCATCCACCCACCTCGGCTTTCCCAAAGTGTTGGTATTACAGGCGCGA 

GCCACCATGCCCAGCTGATTTTAGCTGTATCTCAAAAACAGCATGGGTTC 

TGTTTGCTTTCCTTATTCAGCTTTATAATGTAAATCATTTACATCAAACA 

TCTAATACACCATGGACTGTAAAACACAGCCATATTTTATGTATGAATTA 

AAAAAAAAAACACCACCAATTAGTTCCTGAGACACACACCTTAACAATAT 

CTCTGTGATGTGCATAAATCAATCACATCAGTTTCTCTGCACCTCAAAAT 

TTCTTTCCTCAATTCTCAGAGATATGGCAATTTCTCTGGTTTTACATTCC 

CAGAAGCAAAGAAAAAGTACACAGCTTCTTCAAGTCATGAGTAGCTTCTT 

TTTTATAGCTCTTGGTGTTTGCAAAAAAGATTGGAATTGCTTCACTAATA 

CTAAATTTTCATTCTGCTGCTCTGTTTCTATGACAAGTCAGAGGGCATCT 

TTTTGAAGACATTCTAAACAGCAATTAAACTCAAAACATGTAATGACAAT 

GACACACAAAACTCAACTGATGACCAAATGAAGAGTTCCAGCCAAGTTGA 

CACAAGCTGGCTGACAGAGCTTGTAATACACACAGCTTGGCATATGCCTC 

GCCATTTCAGAGATGTAAAAATAGGAATAAATGTTTTCCCTTAAATCAAT 

GAAATAGAGCATTTGGACTGAAAATCTACGACAGTTATAGTGTTTTCTAT 

TC^TTATTCTCATTCTGTTTCTTCTCCCCCTTGCTTTCTTTTAGTTTGAA 

TATTTTCTATC ATTT CATTTTTCTTCCTACTAGTTTGAAACTTATGCATT 

TATTTTCTATTTTTTAGCACTTACCTAAAATTACTCTGTAATCCATGGAT 

CCTTAATTTATTTAAAAAACTAATGTTAATGAGTAGCTTTATTTTCCTCC 

CATCTAATTTAAGGCCCACAGAACACCTTCACTTACCTCAATCCTCTCCC 

AACTTACATGCTTTTAATGTCATATATGTTAATACCGTATACTTTTAAAA 

CTTTCTAAAATAGCATTATTTTATAGCATGAGTGTTCATTTACATTTTTG 

CATATATTTAGAATTTTCTTTGCTCTTCGTTTCTTCTTCTATTTATGACT 

CCCCTCTGGGATCATTTTCCTTCTACTTGAAGTACATAGTTTAGAACTGC 

ACTATTCAATACAGTAGCCACTAGCCATGTGTAGCTATTGAAGTTTAAAC 

TAAGTAAAATTGAGTAATATTAAAAACTCAGTTCCTTCATCTCACTAGCC 

ACATTTCAAGTGCTCAGCAGCCACATGTGACTAATGACTACTGTACAGCA 

AACATATAGAACATTTCCATCATGGCAAAGAGCTCTATTGATAGTGTTCA 

TCCAGAGTTTCTGTTCCAGGACCAAACTGAGGGTTGGGCTGCTATTTCTC 

ATGGCCCAATAACAAGATGCAGATGAGCTGGGGAGGAAGAGAGTTTTTAT 

TTCTGCAACCAGTTACAGGGAGAAGGCCTGGAAATCATCACCAGGCCAAC 

TCAAAATTATGACGTTTTCCAGAGCTTATATACCTTCTAAGCTATATGTC 
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T ACGTGTAAGTGTG CAT! vJACCTGAAGACGTAAGTGATTAACTTCTftl A - 

ATCTGTAACTAAGGTCTGAGTCCGGAAGATCTTCCCCTGGAGCCTCAGTA 

AATTTACTTAATCTAAATGGGTCCAGGTGCTGGGGTAATTACCCTTATCT 

7GTCCCCTGCTAAATCATGGAGGTTTGGGGAATTCCCTTTAGAGCACCAT 

TAACCTGTTTGTTGAAGGCCTGGGAATTTCCTCCAAACCCCCATTAAACC 

TGTTTAATCCCAAATTGGTTCCGTTAAAAATTCCCTCCTTAATTTGTCCA 

ATTTTAAAGGCCCAAAAAAGGCTGGGGCAAACTCCTGAATGGCCTTTGTT 

ACATTCCAACCTTTGTTTAAAAACACCGGTTTTTAATATTTAACTTAACC 

ATTTAATCTCTACTGAAACACTTGTTATATAAATCTGCATTAATGAGAAC 

TGGCCTGCGCCATATCTCCTTCTCAGAATATC7TAGGGTTGTGATCCCCT 

GTGTGAAGAGAATATATCTCTGGAGATCTCAATCTCTCTACCCCAAAAAA 

AATCTCACTCGGAGAAAACTCAGACTCTTATCTCCACAGCGCTATCTCTC 

TCCTCTCC 

>Contig50 

GCTTGTCTAAGATGGTGCTCCTTGTTGCTGTGCCTGCTTTCATCCTGGGA 

TCTCCCTTCACCATCAGGATTGCCTTCACCTCATTCCAGTCTTGGATCTT 

TCTTCTTGTTTCTTGAGTATTTTTTTTTTTTTTTTGCTGCATTCCCTTCA 

GTGGCCTCTTGGGAAAAGATGTGTAGGGAGAAAAATTTTCTTTAGAAACT 

TGCATATCTGACAATATATTTATCCTATCCTGACATTTGGTAGATAGTTC 

AGCTGGGTACAGAATTCTAATTAATTTTCCTTCCTGATTTATAAGACATT 

GCTCCATTTTCTTCTGGCTTCCAATATTGCTGCTGAGAAGTCTGACACCA 

TTCAAATGCCTGATTTTTTCCATGTGATTGTTGTTTTCTGTCTGGAGTGT 

TGTAGGATTGCCTCTTTATCTACAGTGTTC7GAAATTTCATGACGTAGGT 

CTTTCTTCATTCATTATGGTAGACACTCAGTGGGCCATTTAATCGGGAAA 

AACATGTGTTCTTCAAGTTCTACAAACTTTATTACTTCCTTTTTCTTGTG 

TCTTTCTCTGGTCTGTTTTCAGCCCCGAGTCTCTTAGATCTGTCCTCTAA 

TATTCCTATTGACTTTACTTCATTTTCTAAGTCTTTATCCTTTTGCTTTA 

CTTTCCGAGAGACCTGCTTAACCTTATCTCCCAACTCTTTTATTGAATTT 

CATTTCTTTTACTATATATTTTTTACTTTGAATACACCTCTCTCTTCCTC 

ACATTTTCCCCCATAGTATTTTGTCTTCAATTGACAGTTCTACTATCTTA 

TTACTCTGGAGATATTAATAATAGTTTTTAAATTTTTATTTATTTTTATT 

TTCAAAACAGTGTCTTACTCTGTCACTCAGGCTGGAGTGCAGTGGTGTGA 

TCATGGATCACTGCAGCCTTGATCTCTGAGCTCAAGCTATCCTCCTGCTT 

CAGCCTCCCAAGTAGCTGGAACCACAGGCATGTGTCACCATACCCAGCTA 

ATTTTTTTGTTTTTGAGGTGGAGTCTCACTCTGTAGCCCGGTCTGGAGTG 

CAGTGGTGCAATCTGGGCTCACAGCAACCTCTGCCTCCTGGGTCCTGGTT 

CAAGCAATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGAAACA 

CACTACCATGCCCAGCTAATTTTTGTATTTTTGTAGAGACAGGTTTCACC 

ATGTTGGGCAGCCTGGGTCTGAACTCCTGACTTGTGATCTGCCCACTTGG 

GCTCCCCAAAGTGTTGGGATTACAGGCGTGAGCCACTGCACCCGGCCACT 

AATTTTTAAATTGTTAATAAAGACGAGGTCTTGCTATGTTGCCCAGTATG 

GTCTTGAACTCGTGGGCTTAAGTAATCTTCTGCCTCAGCCTCCCAAAGTG 

TTGGGATTACAGGTGTGAGCCACTGAATCTGACATTTTTTAAAAGTTTTC 

TTCTCTTTACCAAGTCTTTTTTCCCCTTTCTGCTTTTTTGGGTTGTTTTA 

TTTTGATCTCTATCTTGCTAGAAACTTTCTGGAGACGTTTAGTAATACTA 

GATTTTTGAGAGTGGGCAACTGGAAAGCTGATTGGAAACTCTGAATACAT 

GGGTGAGGCTTGTTGGCTGTGAGTGTCATTGCTTGATGTCCTGGCAAGGC 

CAATGGGTTTGGGACCCCTACTATTAGTATAGGCCTGATTCCCTGGGAAA 

GGCTCTTTTGATCTCCTGCCTGGAGGATAAAGGCCTGGCTACCAGCCTTC 

TGTGTGTAATGTGAGGGAGAAGGGCTGGAGTATTCAACATCATGCTGAAT 

CCTTTCAATGATCATCTTGTTTTTAGTAATCTCCTACCTTAACTCTCTGT 

CTTCTGCTAGTATGGGAAAGATGACCTGAAAATCTAACCATTTATTTTTC 

CCCCATTAATATCATTTTATGATTATTCAGAAGTTAAATAATTGTCATGC 

TGTCCTCCAAAAAGACTGAATCAACTAGCAACAAATAAGAATTTTCTCAC 

AGCTCTGCCAGCATTTTAAAAGAATAGCTTTATTGAGCCCAGGAGGTCAA 

GGCTGCAGTGAGCTGTGATTACACCACTCTACCCCAGCCTGGGTGACAGA 

GCAAAACCCTGTCTCAAAAAAGAAATTTAAGGAACAGCTTTATTGTTGTA 

AAATAGACATACAATAAACAGAGCACATATTTAAATTGTGCAACTTATAC 

TTTGATATAACCCTGTGAAAACATCACCACAATCAAGATAGTGAATATAT 

TTATCACCTCCTGATACAGTTTAGCTCTGTGTCCCCACCTAAGTCTCATG 
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TTGAATTGTAATCCCCAATGCTGGGGGAGGGGCTTTGTGGGAGGTGATTG 

AATTGTGGGGGTGCACTTCCCCCTTGCTGTTCTTGAGATAGTGAATGAGC 

TCTCATGAGCTCCCCTTCACTCACTCTCTTTCCTGCTGCCATGTGAGGAT 

GTGCTTGCCTCTTCTTTGCCCTTCTGCCATGATGTGTTTCCTGAGTCCTC 

CCTAACCATGCCTCCTGTACAGCTTGCAGAACTGTGAGTCAGTTAAATCT 

CTTTTCTTCATAAATTACCCAGTCTCAGGTGGCTCTTTATAGCAGTGTGA 

AA&GGAACTAATATACCTCCTAAGTTACCTCAAGCTTCTTCTTAATTCCT 

TCTCCTCCCTTCCTTCATTGCCAAGCAAACAACCACCTGTTTTCTGTCAC 

TATAGATTAGTTTACATTTTGTGGGTTTTTTTTTTTTTTGAGACAAGGTC 

TCACTCTGTTGCCCAGGATGGAGTGCAGTGGTGCGATCATAGCTCATTGC 

AGCCTTGAACTCCTAGTTTCAAGTGGTCCTCCCACTTCAGCCTCCTGAGT 

ACCTGGGACTACAGGGGTACACCACCACAACTGGCTTAAAAAATTTTTTA 

AATAAAAATGGGGTCTTGTTATGTTTCTCAGGCTGGTCTCGAACTCCTCG 

CCTCAAGCAGCCCTCCCTCCTTGGCCTCCCAAATTGTTGGGATTACAGGC 

ATGAGTCATGACTCCTGGCCTAGTTTACATTTTCTAGAGTTTTGTATAAA 

TGGAAACATACAGAATGTATTTTTTTGCGGAGTGGGGGAGTGTTTCTATT 

TCTTTCTTTCTTTTTTCTTTTTTTTTTTTTTTTTTTGAGACGGAGTCTCG 

CTCTGTCTGTTGCCCAGGCTGGAGTGCAGTGGTGCGATCTCGGCTCACCG 

CAAGCTCCACCTCCCGGGTTCAAGCAATTCTCCTGCCTCAGCCTCCTGAG 

TAGCTGGGACTACAGGCGCCCGCCACCACACCTGGGTAATTTTTTTTGTA 

TTTTTGGTAGAGACGGGGTTTCACCATGTTAGCCAGGATGGTCTCdATCT 

CCTGACCTCGTGATCTGCCCGCTTCGGCCTCCCTAAGTGCTGGGATTACA 

GGCGTGAGCCACCGTGCCCGGCCCAAGTGTTTCTATTTCTTAACCAGCTT 

TCATGCAATCTTTTTTTATTTTACCATCTCTGTGATCCCACTCCCAAAGG 

TACTAGATGTCGATTGGTCCTTAGGATCAGCTACCATTTGCCCAACTGCT 

TTCCAGCCTTCCAAAAATTTTTTTCTTTTTTTCTTAAAGATACTCCTGTG 

TGAGGCTCAGAACTCTTGAATTGCTACTGCAAATATGAACTCGGTGATGT 

GAATGCCAGGGAATTGCCTGATTGATCAAAGAAATGTATCCCCTTCTCCC 

T CAC TCTTGCTGTCTTCTCATTTGTTTTCCCCATCCTTGTGGATTCGTGA 

ATTTAAATATCCCTTTAATGTTATAATATTTTAATGGCGTTTGGCGAAAA 

GTACAGAATTAGGTGCAAGAGTGCATAGCTGTTATTTTTTTTTTGGCCTC 

TGAGACTGTTCATATATGCAAGTTATTTAACAGAAAGTTCTGCAGTGACC 

TGAGATGTCAGGGGGGTCTGATAGAGTACGTTTGAAGGCAGTTACTGGAA 

AAAAATAATGCCATTTCTGGTTTGTACTTCGGTAAGTTCAGATGACCCAA 

TATATTGTTTACATGTGGCATTCAGTAAAAAAGTAGCTTCCCCTCCCTTT 

CTTCTTCCTTTTCTCCTTTCCTGCTTCTATAAAGCATCTGCTTTGGGAAA 

CTTCTTAGGAGGAGAGCTTGCCAGCCCGTGGGTAATGGAGAGGTCTTGCA 

GAGATAAAAGAGATGCTCCCACTCAATGCAGGATGGTGTGGAGGTAAATG 

GGGA7ACGTCTGGCATCACTCAGGAATGGGCCTTCCTGGCAGGGAAGAGA 

AGGGAGGGGAAAGAGGAAGGGAGTCAAAGATGAATTGCTGAATACGGGGA 

TTCCAGGGCCTGGAGCCAGGAAGAGAACTTTGGGAGGTGTGAACCTGGAG 

GGCATCAGCTGATGAGGAGCAGCCTGAAGTCCGGGGAGGACCTGTTTTTG 

GTGGC CAGGAAGAAAGTGCCTTCCACACACAGGGAGGCCACAAGGCTGAT 

GGGCTGGGGGTTGGAAGGACAGCCCTAGGACAGGCTTGGGAAGCAGGCTC 

AGGTAGGGACTGCGAGGTTCTTGTTGAGTCTTTTTCATTCCTGGTCTTAG 

AAAATAGAATCCAAGGCCTCTTGAGAGTGGAAGGTGGGTTGGGAGGAGGG 

CAGATGGGGCTTAGGCCCAGGACACCCGTAGAGCTACTGCCCAGCTGTCT 

CTCAGGGACTCTGCTGAGGTCACTCCAAGGATCATTCTTAGCCTTGCTAG 

ACAGTACTGACAGAGGGAACCGTAGTATCGCACCCACTTCCTTCTCTTTC 

AATGAAAGTTTAAAGGTCACCATTTCCTCTGGCAAAGGAAGTTCCACAAA 

TATTCCATTTCCGGTCTTAGAAACAGCAAGGTATCAAGCAATTGCAAACT 

TCCTGTGCTGGGGAATTCCCAAGGAAGTAGGGGCAGAGTTCTGGTGGAGA 

CAAAGTGAATTCCGAGTGATTAGTCAGTAGCAGTAGCAGTAGCAGTAGCA 

GTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGC 

AGCAGAACCAGAATTTCCCCGCACGTGTCTCAGGCTCTCATTTGCCAACT 

CAGTCTCTAAGTATTTTTATTGGCAGGAAAAATAAAATAGCTATGAGTGA 

AATAATTCATTAGACCTGAGCCTCCATCAATTTTGTGTTTAAAGGCCTGA 

CTCTCTTTACCTTTCCCTGGGATGGAAGATGCAAATGTTCCTGATGTCAC 

TGTCAAAAAAGAAGAACCAGTGGGTATATTGTATGCTTGAGTTCCAGCCA 

TTTGTCACAATAGATAGAGATGACTGCCATGTGTG7AGACTTTCTATAGA 
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CTGTGTGCTAAACCCGACCTGCCACTTCCAAGGAGTAGATGAGGAATG x C 

CATGGTTCTGGGGAGCCCTACCCCAATTTGGGGCAGACATTCCAAAGCTC 

ATTTTCTGTGGAGGGGGTTGATGGTTAAAGGACGGCCTGGGAGTAACTCG 

TCTGTACTAGGGCCCAGGAGAGTTACATGCTGCTTCCCATGTTATTCATC 

ATTCCCCCATGTGAATAGCTATGGCGTGAGGTCCAAGGTTAGGGCCTTTC 

TACCATAAATGGGGGAATAAAATTCCCCTACCAGCCTGAGAAGTTTCTGT 

TAJAAAGAGGCTTTTTT.TTTGCGGGGGTGGGGGAGCAAGCCGACTAATGT 

GTTATTTCCATACGGTTTGTTTTAAAATGTAGATGTCATATGCAGGAGAG 

GTGGTGTAGTGAGTCACAACGGGATTAGAAGGACCAGTCCGAAAAGCAGA 

AGAGGGTCAAGTTCAGGGCACTGAGGACTACTGCATTCAGTGGCGTGAAA 

GGCAGATGGCTGAACAGGAGGGGGACATTACATTGCTTGTTCTCCTTGAG 

CCTCGATTTCCTCATCTAAAAAGAGGGTCATTTATTCACAGAACATTTAT 

TAAACTTGTGCCAGGCACCGTGCCAGGAGCTGGACTAAAAATTAAATCCA 

CCCCTGTGAGCTGCTCTGAAGGCTAAAATATGAAGTATGTAAAAGTAACC 

AAGTGCTGTACACATGCAGCTATTCAATGACTGTGTGGGCATTGCGGCAG 

ATTTTAATTTTCTTTTTTATTTCTTTCTCTTTAGTGAGAGGIGTTGGTTG 

TTATTATTGTCGTCGCTGTAACTGTCTATTTCACTTGCTTTTTTGTTGCC 

TCCAGCCCATTCCAGGGCTGTCATCTAAGACACTTCTTATCACCTAAATA 

ACCGGGGAGGCAAAGCGCTTTCTTAAGAGATGGATCCAGAAGAACAATGC 

TGGTTTTCTGTAGAAAAAQSGGCTGT-GGGAAGTAGAGATAAGAAGGGAAT 

TGGCCAAGATGAATGTACAGAGCCTTATTTTTTTTTTATAACACAGCAAG 

ATTAGATACAAAACAGGACAATAGCATCATCTGTTTTTATAACTGGAAAG 

GACCTCACTTTACAGGTGGGGAAGAATAGAGTGGAGAAGTGAAGAGAATG 

GTCACAGAGTCAATCAGCATGTCTGCGTCAAAGCTGGGATTCCCAATTCA 

GGGCTCTTACTACAGTGACGTATGGCTAATATTTTGGCATTGTTTCGGGG 

AAAAGCTGAAGCCCTGATGGTGTACGTCACTCTTGAGATAGTCTGTAGTC 

CAGCAGGGAGGAAAGCAAGGAAGGGAGGTGGAGGCAGCATTTTTGGGTGT 

AACATTTCGTTCTTGTTTTGTGGCCAAATCATAGTGTGATTGGGACAAGC 

CACTGCCTTTCTCTGAGCCTCCACTTTCTTTTTCTTCTTAAGAGGGAGGG 

AATAGTAGAGTAAAAGTAGTCATTTTATCAAACACCTGCTATTTTGGAGC 

CATATTGCAAGTGGGTTGGGGGTTGAACACTTGGCTTTATTACCCATAGG 

ATTAAATCCAACCTCGATACTGTGGCATTCCCAAACTCCAGTCTAATCTT 

CTTCTCCATCAGCCATGCCCCACGACACCCTGGTCATATCTGATGTTGCC 

CCTTGCACTTGCCCCCTCCTTATCTTTGCTTTCTGACCTACCATATGGCT 

ATTGGTTGAAATTCTCATTTTCCAGGGCCTTGCTTAAATATCATCTCATC 

CATTAAAACTTTCTTGAACCTCCCCTTGCCCTGTTCCTCCCTAATGTCTC 

AAGCCAGAATTTATTTCCTTTTGTGGCCAAGGGACTGGGTTTGTGACCTC 

TCTCACGAGACTTAATATTGAGACCAAACGTCTTTAGACCTCACCAGCCA 

3AGAGATGAGCATCTATGGAATGCAGGCTTTTGCCTGGACTTGCTGATGC 

AGGGCCTCTGCCTTCCTCCAGGGCCTCTCCTGCTGTTTTAGGAATTTCCC 

TCATGGCACAGTCCATGAGCTCAGGGTCAAGTTCATACATGTTTTTACTT 

CTTCTACTCTGCAAATGGTCTTCTTGAACTCTGAGGGTCCTAAAGCTGCT 

CTGCAGTTTGTGGGGTGAGTAGAAAGGGGCTTTCAAAAGTTGTGCTGTTG 

TTTCCCACCCCAATAGCATGAAACACAAAGATGCTTACAAATAGCTGCCT 

TGCTTTCTAGTCCCAACTTCTCTCTCCTGAGGCTTTAAAACAAGTCCCCT 

AGGTTGAGCTGGACTGGAGTTGTATCCTATCTTCATTATCTGTCTACTCT 

CTTTCTGCTCTCTAGAGAAGATATTATATATGTGTGTATGTATGTGTAAA 

TATATAATATCCATATATAGAACATATATTGTTATATTTACATATACATA 

CATAACATATGCATGTATTCATATATACATATGTAGTATCAAAGTTGGAA 

TTAAAGTGTATATTTTGTAATTTGCTTTTATTTGCATCTATCACTGTAAA 

TTTGAAACAGGGTCTTGCTTTGTTGCCCAGGCTGGAGTGCAATGACCCGA 
TCTTGGGTCACTGCAGCCTTGACCTCCCCGGCTCAAGTGATCTTCCCACC 
TTAGCCCTCTGAGTAGCTGGGACTAAAGGTGTGTGCCTCCACACCCAGCT 
TTTTAATTTTTTTTGTATTTTTTTTTTAAAGACAGGGTTTTGCCACATTG 
CCCAAGCTGGTCTTGAGCTCCTGGGTCCAAGCAATCCTCCCACTTTGGCC 
TCCCAAAGTGCTAAGATTACAAGCATGAGCCACCACACCTGGCCTCAATG 
TAATTTTTAATGGCTGTATAGTATTCCATCATGTGGTTGTACCCAAAATT 



TGTTTTAGTAAATACCCACAAAATATGTACAATGGCTGGGCTTAGTGGCT 
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CACCCCTGTAATCCCAAvJ^CTTTGGGAGTCTGAGGCAGGTGGGf^CCt'S 
AGGTCAGGAGTTCGAGACCATCTTGGTTAACATGGTGAAACCCCGTCTCT 
ACCZAAAAATACAAAAATTAGCCGGGTGTGGTGGCACACACCTGTAATCGC 
AGCTACTT3GGAGGCTGAAGTAGGAAAATCACTTGAACCTAGGAGGCGGA 
GG7TGCAGTGAGCCGAGATCACACTACTGTACTCCAGCATGGGCAACAGT 
GAGACTCCATCTCAAAAAAAAAAAAAAAAAAAAAAAAGTACAATTTGTTG 

TAAGTAAAATTTTGAATCAAGGGAGAAGCACCTGGAGTCCTTCAGATACC 

TATTGCCAAACTGAACTTTTCTGTTCCAGGTTTACTACATTCAGCCTGAC 

TCAGGGTTTGGGGAGTAGAGGAGGGGGTGGAGGCAGAGGGCCTCTCCCTG 

TCCCCACAGACCTCCCTTGGTGAGGTCCAAGTCTGGACAGGTGGAGTGTG 

GCATTGCACCGTCAGGTCCTGCTTCCTGTAATTCCCCTAAATCCATCCAG 

TGGAGCCTCATTGTTCAAGTCTTTTTTTTTTTTTTTTTTTTTTAACTCCC 

CTGAAGACGGAGTCTCACTCTGTCGCCCAGGCTGGAGTGCAGTGGCACGA 

TCTTGACTCATTTCAACCTCTGCCTCCCAGGTTCAAGTAATTCTCCTGCC 

TCAGCCTCCTGAGTAGCTGGCACTACAGGCGTGTACCATCACGCCCGGCT 

AATTTTTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAG 

GCTGGTCTCGAACTCCTAACCTTGTGATCTACCCGCCTCTGCCTCCCAAA 

GTGCTGGGCTTACAGGTGTGAGCCACCAGGCCTGGCCTCAAGTCTATTTT 

TTAACTCCAGGAjGGCCTGGTATTCAGAGGGATTAGGGCTGGCAGAAGGGC 

CTC^AGCTTtCAAGGCCTGGGGAATAGGCTGCAGCCTGGTTCAGGGTAA 

CCCAAGTGATTTTGGTTCCAAAGGGACAGGAAAAAAAGTGATTGATATGG 

AAGTTGTCAAAGTGCAACTGTCAAGACATTAAAAAATGTAACCCTTTTAC 

TAATATACAGTAGACTTGTGTTAAATATTTAACTGATTGTAAAAGGAAAA 

AACCAGACGCAGTTTTCCCTACCATACTGTCACAACACCTCAACACTGAG 

TTCTTCTGTGACCTCTAGTCACCGAAATGCTTGGGGATTTCTCCCACCAC 

TAGTCCTCCAGCAGCCGACACCAGTTGGGTGTCCTAATTCACTCCAACAC 

TATCTACCTGGAGTTAGCGTTAGATCCCACAGGTTGAGGGCTCAGTCT'CA 

CAAGACTGCCTCCCACTTCAGGTGCCAGTTACAAGTGGTAGGTTGTCACC 

TATGCTTCTGACTGATGGCTATAAATCTGGGTTTGCTTJCCCTCGGGTTCC 

GTGAATTTGCTAGAGCAGCTCACAGAACTCAGGAAAACACTTAAGTTTAC 

CAGTTTATTCTAAAAGATATTACAAAGGATACAGATGAACACCAGATGAA 

GAGATGCGCAGAGCAAAGCATGTGAGAAGGGGTGTGGAGCTTCCATGCCC 

CTCTGGGGCACCACCCTCCAGGAACCTTCATGTGTCCAGCTATCTGGGAG 

CCCTTCCAAACCCTGTCCTTTTTGGGTTTTTAAGAGTGGCTTTATTACAT 

ACACATGATTGACCGAACCATTGGCCATTGGTGACTGACACAACCTTCAG 

CCCCTCCACTCCCTCCAGTGGTTGGGGAGTGGGGCTAACAGTC7CAAGTC 

TCCAATCCTGCCTTGGTCTTTCCTGTGACAAACCCCATCATGAAGCTACT 

GCATTGGGGCTGCCAGCCAGCAGTCATCTATTAGCATGCAAAAGACACTC 

TTATTATTCCAGAGATTCCAAGGGTTTTTAAAAGCTGTATGTCAGGAAAC 

AGGAGATGAAGAACAAATATATATTTCACAACATCACACTCGTTGGGGGA 

ATTGACAGGATAGCAAAACTGATTAAAGGAGGATAGGAGAGACTGAGATA 

TATATTTCCATATATATATATAGAGAGAGAGAGAGATATTTCCATATATA 

TATATAGATCTAGAGAGAGAGAGAGATAGAGAGAGAAGAGTCTTTCC 

>Contig51 

ACACATTTGGGGGAGCAGTTCCGGAGGTACAGCCCGGACAGGAGATGTGA 
GAAGATCGTGGTTANTGTTCCCCTGGTCCAGAACCCCTCCAAGTGGGCTT 
AAGTAGGAAGGGTGGTGAGCGGCAGGTAAACACACGTCAAAGGCAGTCTT 
CCTCTCTGAGGGAAAACACTTGTATAAGCATTGCAATCAATGGGCCTCTT 
TAATTATGTGCCAGTGGCAAGAGCGGGTGCTGAACCCAGGGGCCTGCCTC 
AATCCGGGGCCTTTGAGGCAGAATAAAGTGGTCTCAGGTTGTTGGCATTT 
CCTTGCCCTTCCACCCGAAGCAGACACAAATCCTCTCTGGAGGCAAGTTC 
CCCAATTCAGCCAGTACAACTCCCACAGACTAAGATCAATCATGTACAAG 
CTCACAGACAAAGGTCACCAAACACACAGAGCAATAAACAAATTCATGAG 
TGACGTGAATGAGAATAAACAGAAACAATAACCACCAGCTGGGATGCTCT 
AAGTCTTCAGCTGTTAGAATTCCTGAATATAGAATAAAACTGCCACAATG 
GCAAACATGCATCTAGTACTTACTGTGTGCTGGGTTCTAAGAATTTTGCA 
CATTGTGCCAGATACCGACTCAGCTTCACACTCACCCTCCTACTGTGCCC 
TCTTAATTTGCACTAGATTAAAAGGTAGAAAGGAAGAGGCAGCTATTCTG 
TTCTTGGCTG7GCCTCTGGCAGCACATGCAAAATGGGCAGTAACAGTGGC 
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GAGACGAGCAAGGTTCCTAAAGGGACAGTGGCCAGTAAATGACCAGGGGC 

7AC7GGAG7GGC7GCA7GGCTC7G7GGAAGC7CAGAGGAGCC77GGG7CC 

TGCAGGTGCAGTAGCAGCTTTCTGTAGTTCCTGATCTCTGGGTCCCACAA 

TCTTCCCCGTTTTTGCTCCTCCACTTCTAATTTTGTAACTGACTTCCCTG 

TGTGTACTTCTCTCTCTGATTGAAATA|3CCAGACTGGT7TCTGTTTCCTG 

ATAAGACATTGTCTGGTACGAACACAGTAACTCATTTAATCCGATATCTC 

TATGAAGGAGGTACAATAATTATTCCTATTTTACAGATGAGGAAACACAG 

CAGAAAAATAAAGTCAATTGTCTAAGGTTGCACATTTAGTCAAGGGAAGG 

GTTGATATAACATATAATTATTTAGAAAACATCTAAGGAAATAAAAGGCA 

TAATTTAAAAATAAAACTAGGCAGGTTTAAAAAAATGAAGTAATCTATAA 

GTAAAAAAGTATAATTGTTGAAATACATATCTTAGTGGATGGGTTAAATA 

GCTGAAGAAATGATTAATGAACTGGAAGGTAGTTCTGAGGAAATCAGAAT 

TCAGCATAGATAGAAAAAATGGGAATTTACAAAAGTACACAGGAATTATA 

AAAGAGGTTAAATTATAGGGAGGGTAGAATGAGAATTAACATTGGTCTAA 

CTGGAATTTTGGAAGAAGAGAATAGAGAGAATGAACAAGGCAATATTTAA 

AGAGGTGGCTGAGAATTTTTCAGAACCAACACAAACTATGACTTTACCAG 

TAGAGAAAACAATGTACACTGAGGAGGATAAATAAATATACTATGAACAA 

ATTGTAATAATAATACTCAACAAAGACAAAGAGAAGATCTTAAAATCAGC 

AAAAAAAGAAAGTCAGACTTAGAAAGAAATGACAATGGCAGACTACTCAA 

.CAACAACAATGGAAACCAAATTCAGTGAAACAGTATTTTCAAAAT^CATA 

TTTAATCTATCTTTGAAGAATAAGGGTGAAAAGGGTGAAAATTGCTGCCT 

TATACAAAATATCAACATTAACAAAAAGTAATGAAGGTAATATAAAAATG 

TTTTCAAATAAACAAAACTGAGAGAGTTTACCACCAACAAGCATTCATTA 

AATGGACTTTTAAATGCAGTTTTTAGGAAGAAGGAAAACAATTCCTAAGG 

AAGG7CTGAGATGCAAAAAGGAATTATGAACAAAGAAATTGTTAAAATTA 

TAGGTGAATTAAAAAAACTGCCTGCATAAATGATAATAATGACAATGATG 

CTATTAATAATGAGTTGATAAGGATAAAGAAAAGGACAGAATTAAAATAC 

TAGAAAACAAGCATGCTGGAAAGGATTCAGGAATTACTTGAAGGTTAAAG 

TTCTAGGGTCCTTCTATCCTTCTAGAGGGGAGTCAATATATTAATTTTTG 

ACCGTCACTTACACAGTGAAAAACTTTAAGGATAACCATAAAAAAATAGA 

AATAGAGAGTATAACTTCTGAAACAGTCAAGGGAAAAATATGGAATAAGA 

AAACTGACCAAAAAACATCTCAGTCAATCAAAAAAAAAAAAAAAAAGAAA 

GAAAAGGTTCGGAAGGAGAAAATCAAAGCATAGAAAAAGCGGGACAAATA 

GAAG7GGAAAAGAAAAAGGTAGAAGAAACAGGTCCAGAAATATCACTGAT 

GCAC7AAA7CACCA7TAAAAGA7GAAAACAAA7GAACAACA7CAAAAAA7 

7C7AG7GAC7G7AG7AG7GC7GA7CAGAA7AGGC7C7AAGA7AAGA7GCA 

77A77G7GAG7CAAC77G7GA7GA7GAAAGG777AA77CACCAGAAAGAC 

ACAA77A7AAACTTGTAATCAAATAG7TTTA7TTTAT77ACTTTA7TTA7 

77A77777777GAGACAGGA7C77G77C7G77GC7CAGGC7GGAG7GCAG 

7GGC77GA7C7CAGC7CAC7GCAGCC7CCACC7C77GAGGC7CAAGC777 

C77CCTGCC77AGCC7CA7GAG7AGC7GGG7CCACAGGCACACACCACCA 

AGCCC7GC7AA77T77G7ATT7T77G7AGAGA7GGGG7TTCACCA7GT7A 

CCAGGC7GG7C7CAAACTCC7GGGC7CAAGCGA7C7GCCCCCC7CGGC77 

CCCAAAG7G77GGGATTA7AGGCG7GAGCCACGG7GCC7GGCC7CAAA7A 

ACTAT77AAG7GAAACAAAACTAG7A7GGCAC7AA7GAAAAATGTATAAA 

7CCA7AA7CGCAGAGGGAT77CAAC77AC77C77TCGATTA7G7AAAGG7 

CAAACAGACAAAAGACAA7GACAAAAC77AA7GCAA7GAACAC7777GA7 

77AA7GAACA7A7A7TGGA7A7G7ACCCAAGAA77AGAGAA7ACA7AC7A 

G7777GAG777A7GCAGAACA77TACAAAAA777AG7GGAAGCC7AAA77 

A7AAAAAG77GC7G7CACG7AGAA7AACACACAAACCCC7GAG7CCGGAA 

77CAAAGCCC7CCACAC7C7CC7C7ACC777GCA7C777A7CC7CCACCA 

CAC7GCAG7GCA7AC7C7GGGC7AC7AC7CAC7G77C77GA77CAAA77C 

CA7G77C7G7CAGCTCAAATCA77C7C7C7GCC7GGAA7AACTAC77CA7 

ACA7A77C7GC7A7TGAA77C77G7C77AGCACCCCA7CTAC7CCAAGAC 

GA7G7CCAG77GGGG77ACTCCC7G7CCCA7777C777GA77ACAC7777 

77777C7AC77CCA77A7A77A77GA7CACA7C7G7GCCACAG77777GA 

C777G7G7C7GC777TAC7C7T77C7AGACCC7GAGAGC7CC7GAAGGG7 

7GGG7CA777C77777TA777GC7CA77CC7CA7GGCACAG7GAG7GC77 

AATAAATGGC7ATTGACTGAAATTAAACTG7ATC7AAATGGACATATTCC 
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ACTTCTGGGCCATTCATrCTTTCTTTCTATTGGAACCAGGAGATGbGGAA 

CCATAACAAAGGTAAGGTTGTGCCATGTGAAAGAACATGGAACCTTCCCC 

TGAGGGCCAAAAAAGAGCAGGGAAAGGTGCAAAGACAAAATCTTCCATTT 

7TAAACAATGTAAGAATGTGGTCCACCTCATGCTCAGGTGGGACTTTATC 

ATGACGTTATTTTTGGGGACTTATAGCTGCATCATTTACCCCATATACAT 

TTACCTTTAGTGTAGGGAACTGAGGACAGGAATTTTGTTGATGCAGACTC 

TTGCTAATGAGGCTAACACTTGGAGAATTTTTATCATGCATTCAAGAAGC 

TTGTTTTACATTTCTTCATTAATACTTTAGTTGGTGGTTTAGCTTTAGTT 

GTAGGCTTATCAGATATTTGGAGATATCTTCATAAACGATGGCTTTGGTT 

TTAGAAGAGTTATTCTGAAGCTACTATTTCTGGCAATAATCAAACAGCAT 

GGCCATTTGTTTTGTAAGGCCTTTCCTAAAATATGACGGTAAAATCTACG 

TGTGGAAAAATGCTTATTCTTCTGTCCTCTATAAATGTGAATCTAGTTTG 

TCTTCAAAATGAAATCAAGTGATTAAAATGTAGTTTTCTAAGAAGATAAA 

TGGAGCAAAGCACTCTGTGTTTCACAGTGTTGGAAATCACTCATCCCTCA 

TAAAACTGTCCCAACTGATCCTGACTCACATGAATGAATTAAAATAAGAG 

TTAATAACATCAATTTACATTTTTAAAGACACTTTCCCATGXTTTAGACT 

ATTGGTTGGAAAAGCTGGTAGGTGTACAATTTGTGGAGAGTTGGCTGTTT 

TTGTCTGTCGTTGTTTGACGTATTTCAAAGCCATATCTAATTTTGTTGCA 

GAATGGTCTGAATTCTACAAAAATGTTGAGTTGTGTAGTGTGGAGAAGTA 

CGGAGCCATTTACTGAAAGGCTGGGGGgAAATGACGAGACCCTGAGATAA 

GGCAGTAGTGGTGCGAACAGAGTGGAAGGGAGGTAGTTGAGATATGTTCA 

GAGTAGAATCAGAATGGACATAGTGAACAACTGGATGCAGGTGGGGGCTG 

AGGAAGCAAAGTTGAGGATAATTCTGAGACTTCTAGGTTGATCCACTGAA 

GTTACATTATTCAACACCACAAGGAAACTAGGGGAATGAGAAGGCATACT 

GGTTTGCTTTGGAGTGGAAGGGCAGTGATGTAAGAGGAGTTAATGAGTTA 

AAGTTTGGATATGCCTGAACTTCAATTTGATATGTGCATCTGATATACCC 

TTGGGGTGACCCTCCAGGCAATGGTTGAACATGTGTATTTCTTAGTAACT 

GATAGGCATCACAGACTCACATCAGTAAGGAAGCAACAGCAAACTTGATT 

GGACGATATACCTGGAACTCAGTACCCTATGACTGGAGCAAGTCTCTGTC 

AGTGAAATGAGGATAAGAAGAATCTTGACCTTGTGGAATATGTTGTTAGG 

AATATATGTGATGAACAACATAGGATACTTCCTACAGGGCTCCACATGTA 

GTAAGGGCTTTATAAATGCTTGATAAATATTATTG7TGTAATTTATTTCC 

AAAGTAAGATGCCACTGGAGGAATCTTTGGAACCCAAATTAATAACAAAT 

AGGACTGGATGCAATGGCTCACACCTGTAATCCCAGCACTTTGGAAGGCC 

AAGGCAGGAGGATCTCTTGAGCCCAGAAATTCAAGACCAGCCTGGGTGAC 

ACAGGGAGACCTTGTATCTATGAAGAATTAAAAAAAATTAACCAGATGTG 

GTGGTGCACGCCTATAGTCCCTGCTGCTTGAGAGGCTGAGGTGGGAGGAT 

TGCTTGAGCCCATGAGGTTGAGGCTGCAGTGAGCCATAATTGTGCCACCA 

CACTCCAGACTGGGTGACAGAGTGAGACCCTATCTCAAATAAATAAATAA 

ATAAATAAATAAATAAGTACAAACCAGCAAACACTAATCCTTTCTAGAGA 

TTATTGAACTCTGGAGGGCAGATCTGAATGGAGCCAGCAGAGGGACCTAT 

GGAGATCAGCCTGGCCCTGGACAGCACCAGGCAATGGGGTTGCTAGAGAG 

GTAATGGGGTTGAACAGGGTTTAAGCCATGAGGTCTCAAGAATCCGTGAA 

GACTCAGACTAATTTTTTTTTTTTTGCATGAGGATTAGGTGTTCCTAGGA 

ATTTCAATGAGAGCAGGGTTAATGAAGGAATGCAGGGTAGGAGAGCTGAG 

GGAAGGCATCTGAGAGAGCCTGGCTTATGAATGGCTGCGTCAGTATGGCT 

CACCTGCTTTCCTTGTATCTACTTAGCAGATGATCCCACCCCAGGCCTCC 

AGGGCCAAGGTCATTTCCACATAGTCATGGGCCCTTGAGGGCCTGGAGCA 

GTGTAAGGAAGACAGAGTCTTAAGAAATTGCATTAACAGTCATGGTGCTT 

GGCAAGTGTCGTCATCCTATGCCAAGCCTGATCTGAAGGGGTGCATGCTC 

ATAGGTAGCTGCTGCCCAAGATTACAGCAGCTTCTTCAATCCCAGATCCA 

TGCTCTCCTATATTCATTTTTCCAGGGGTTCCTGTCCTTCGACAGTGATG 

AGATGCAGAATGACTTATTGAGTTATTCTCCTGATAGTTGCCAACTTTTC 

CAAATGACAATGGGGCATGGAGCTTGAGAGTGGAAATGAGGCCCTAGGGA 

TAGCGTGCTTAGGAAAACACTCCCAGCCTGATGTAATTCTGGGGGTACAA 

TGGCATTTTCATCATCAAGACTGATGTAAAGGGTGACTAGCAGTGAGTTG 

GGGGTGACTCGCACTGGGGCTAGGTTTCTGATTCTGCCTAATCCAGACAG 

AGCAGAAGCACTAGTGGGCTGGTAGAGGGCCTCCAGGGCCTCACTTAATG 

TCCTGGAAAAACAGCTCCAGATTGTTGGTTCACGTTCTGAGGACAAGCTT 

GGGTACTACAGGATAGAGAGAGTGGTGGGAGATGCCGTGGCCTGCCCTGC 
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TGATGCCTGCCCTGCCATTCCTGCGTGTGATGTCTCTGGGGCATCTTGCC 

TTCCCTGCCCAGACCTGTAGTTCAGCTGAGGGCATGTGGAGGCCAAATGG 

CTTCTTAGAGTGTTACTTTCCTTGAACAGCTCTGCTGGGAGAACTGGAGG 

AGCTAGCTAGTCACGGTAACTGCAGCAGTCAAAGGATCGTCCCGGTGGAG 

GTGGGGTGGAAAGGTAGAGAAAGAGAACATATAGCGTTTTCCTTGGAGAT 

GTGTGGGCATGTCATAGAGGAAATACCCAATTCCTGAGCCTTGAGCCCTC 

CAGGAAACCTTGGAATATTAGGTTAGTCATCCCCAAGGAAGTCTAAGAAT 

TCTGGTCTCACCCATCTCCTTTAATTCCCACAATGATCCTACATGATATT 

AAGGAACACGGGCCAGTAACCCTCCAAGCAATGGATGTGGTGGTGAAGTT 

TGACCTCATGATGGAGCGGAGGTTGGTTTGAAACCTAAGAATTTAATTTA 

TTGTTTCAAACTGTTCTCCACTCAGCGTTATTAAAGCATACATAATTGAC 

ACATAAAAATTGTATATGTCTACGGTGTACAATGTGATGTTTCGATCTAT 

GTATACATTGTGAAATGATTACAACAAGCTAAATAACATACCCATTCATC 

GTGTTTCAAAGGAATTAAACTCAAGCACAAAAGAGAGGTGCTGTTGAAGA 

GTAGGGC7GCTCTATCTAAGTAGTATGTCTGGGGTTGTCCTGGATCAGGG 

TCCTTTTGTGCTAGTAATAAACCAGCCCTTCTGGGGCTGCTC€ACTTTCC 

CCACATTTTCTTCTGGAGCCTCCCTAAGAATTAGGACATGGCCACTTTCT 

CTGCATAGGCTTCCTACTTCAACAAGGACAGGGCTTGTGCTGCCCCATGC 

CACTTGAGTGTCCCTACAGCACAGAGCTGAGTGCACACTGGCTGAGTGAG 

GAAATCCCCCAGATTAATCTTGGTTCTAAGCATCATGGCTGTATTTCACA 

CGTATATGAATTACAAATTACAGCATAGTCGAATAAGGATTTTTGTGCTA 

CAACTGGAATCCCAGATTATGCAAATTGGATAGTATAATATTGAAATTCC 

TAGGACTTTTTATTAGTTTTAAAAAATTATACAAGCTTAGAGTAAGAAAT 

TAAACAGTGCAAAAGAATTCACTGTGAAAAGTAAAATGCTCTGTCTGTGC 

TGAGAGACAGATATTGCAGCCCAGATACTACTGGGGTCAATAGTTTCCTT 

TAAGCATGCCATTTTGATGGTTTATGGGACTTACAGCTCAAGAAGCTTGA 

CACTAGGGTTGATCTCAGAAAATCATTGTTGCAGGTATTAGATATGACCG 

TCTCATAAAGATACACACACAGACACAGCGATTGGAGATATTCACTGGGG 

CTTATGGGCTGCTTGTCCTTTCTGCTCTGTGCCTAAGTTGGGCTCAGAGT 

AGCCTGGCATCGGCTGTGGGGAGAATGCTGGCATGGGGTTAGCAGGAGCC 

CACTTAACATGTCCTAAGCCACCTGGAAGAGTCCTTCAAGGAGACCAGAC 

TCCAGAGGCCCTAAGGAAGGAAGGACTTTTGCCCGTTTTTAGGTATTCTA 

GTCCCAGAGTTTAGGGAGGAATGGTTTGGCTTTGGGTCGTGTGCCCCTTT 

ACCGAGTGGGATGGGATGTGCCCATGAGCTGTTGAGCTGGCTCTTGGAGA 

AGACAGCAAAAGCGGGAATAAGAGGTCAGGAAGCTGTGTGGTTGTAGGAA 

ATCCCAGCAGAGGGCCTGGGGGTCAAAAGTGGTCATGGTAGTGACGGTGG 

AGGCTGAGGTGGTAGAAAATCAGAGGACAAACCCCATGGGCTGCTGGTGA 

TCTGACCGAGCTCCTATGCTCTCCTGGTTCATTTTAGGCTCTGTAGCAGC 

AGATGATTGGCTGGTGTGAGAGCAGTGCACCTGCCATATCAGGCAATCCA 

AGACAAGTCCAAGCTACGCTGGGAGGAAACCTGAAGGCAGCAGCAGGTAG 

ACTGGCTGAAGACAGACAGGCAGGCAACTTGTCAATCAGATTTGTGTTTT 

TAAGGACTTTTAACTGGGGAGCCCTCCATGACAGATCAGATGAGAGAGGA 

ATCTGGGTCCGCCCATGTGTCAAGCTACCAGAGGGTCCCATCGGTGCTTG 

GATCTTCTTTGAAGCTGGGTCTGAGGTTTGCAGGTAGAGGGTGAGCTGGT 

CAGAGGGACCTATTGCAGAGCTAACCAACACCTTCCCAGGAATGCAAGCA 

CAAGCACCCCACCGCGGGCAGGCGGGCAGGCACTTCTCCTTTTGCCACCA 

GGACCTCACAGAGGCTGATCTGGCTCTGTGAGGTGGGAAAATGGGTTGTA 

CTTAGTACATAGAGATAAAAGGCTTAGGAGGCCCCTCCATCCTGTGACCC 

TGTCCCCAGACCACAGGTGCCGGCAGGTGCTGCTATTTCAAGGCTGGGCC 

TCAGTGCAAGCTTGTGGTTTCTTGCCCACCTGTGATGTCCTCCCACTAAT 

GAAGGGGCTCTCCATCCTCTGTCTGCCTCTAGCAAGTGGAGGCTCTGGGC 

CCTGGGCAAGACACAGGGGGAAATGCCATCTGTTATCCAAATATATTTCA 

ATGTGACAGGAAAGCTGTCTTTAGAGCACAGC 

>Concig52 

GCATGTGCTCTACATTGATCCCAGGAGTTTGAGACAACATTGCAAGACTG 
GGCAACAAGCAAGACTCTGTCTCTACAAAAAATAAAAAAAATTAGTTGGG 
CATGGTGGTACATGCCTGTGGTCCCAGCTACTCCTAAGTTGAAGAGGGAG 
AATTGCTTGAGGCCAGGAGTTCAAGGCTGCAGTGAGCTATGATCACACCA 
CTGCACTCTANCCTGGGTGACAGAGCAAGACCCTGTCTCTAAAATAATAA 
TCGTAATACATTTTTTTTAAAGTAAAACAAAAAAAGGTCACACTTTCTCA 
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TACCAAAATAAATTCCAAATAAATTAAAGGCTTAAACATGAGAAAGTTAA 
ACCATAAAATTACTAGAAGAAAATAAAAGCAAATATTTAGATAATCCTGG 
GGATAAATTTCTTTGGAATGAATTTCCTTAAGATGAATCTCTAAAAGTGA 
AATTCAGGG7TCAAAGGTCTTTTCTTTGTCCTTTTCTTTTCCCTTTCCCT 
CTCCCT7TTTCTTTCTTTCTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCT 
TTCTTTCTTTATCTTTCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTTCC 
TGGTTGCTTGCTTTCTTTCTTTCCTTCCTTCCTTCTTTCTCTCCCTTTCT 
TTCTCTTTCTTTCTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTT 
CTTTCTTTCTTTCTACTTTCTTTCTTCTCT7TCCTTCCTTCCATCTTTCT 
TTCTTTC77TCTTTCTTTTCTTTCTCTCTTTCTCTCTCTCTCTCTCTCTT 
TCTTTT77TTTCTGGTGAGACAGGGTCTCATTCTGTCACTCAGACTGGAG 
AACAG7CGCA7GAACA7GGC7CACAGCAGCC77GACC7CC7GGG77CAAG 
CAA77C7CC7GCC7CAG7C7CTCAAG7AGC7GAGACCACAGGCACCCACC 
ACCAAACC7GGC7AATTTT7G7A77777AG7AGAGA7GGGGT7TCACCAC 
A77GGCCAGGC7GGTC77GAAC7CC7GACC7CAGG7GA7C7GCC7GCCT7 
GGCC77C7GAAG7GC7GGGA77ACAGGC7GGGCC7C7ACGCGCGGCCGAG 
AC7ACC7C7C7777AAC7GGATC7C7GAGC7C7GGGCAGAGCCCACCC7G 
AA7CC7GG7C7CCAAAAAGGGAAAA77A77AGGAGGCTAGACCA7A7GA7 



7C7ACA7G7CTAAAC7ACAA7C77CC77GAAAACCCAAGAG7AGC77C7G— 

77GCAA7AGC7AG7CAAAAA7A7AA7AG7CAAAAAAA7CAGG7AACACAA 

CACAAACGCAAGCAGTT7AAGAGC7GAAA7GAAC77G7C7G777ACAC7C 

TAGGGA77CCA7AAGGAAAAA7AGAAG777C7CCC7AAAAGGGAGCC7GG 

CACC77C7CCA7777C777AAGGAACCCCAGGC7A77A7AAAC7A7777A 

GGGC7C7CA7GCAGCAGACGG7GCAAGAGAAAGGAGAGACAGCAGAAG7A 

AA7GAAGAAAACAGAA7CCAG7CAACAGAGAAGAAAAAAAC7777GC7CA 

AAAAAAGGCAAG77CC7AGGAAAGAAAAAAAAAACA7GAGGGC7A777AA 

ATACAAAGACGCA7ACA7ACACA7GCACACA7C77GGA7G77AGC7777A 

AT7AAGC7GAC777TAACTA77GAGG7CC777AAAA7AAA7C7777AAAA 

7C77A77ACGATA7T7CAGC7AGGACAAA77GC7GC7A777CAGCA77AC 

CAAG7A7CAAACCAGAAAAGGC77GA777AGGAACCAAACCCAGGC7G7C 

G7GG7AGGAAAAAAGGCAGAACG77AGC7A7GGAACCCACAGCA7GGGGC 

AACAGCCA77GC7CT7TCAG7A7GGCC7GGC7AGCAAAAAGG7GGCC77G 

77A7G7AAA7AAAGCCCGT77GG7GG7CAAAA7GAAACA7C7777CC777 

77777777C7777GC7GGCCG777777CCCCCACCA7ACCACG777G7G7 

GTG7GGGAGGG7GGGAA7T7AGCCAC77CAGAGGCC7CA77CCCCA7AA7 

77GGAAA777CC777GGA777GA7CAAG7CAGA7AGAG7AGG7CAAACCC 

AA7GGGAAAAAGAC7GAAACAGCAA7AAAAACAGAAACAAACAG77AAGC 

AAAA7GAA7GA7CACACAAC77A7A7GA77AC7GAG7GC7C7AA7GG7AA 

GGAGAAA77AAGACCAGC7GG77G77AAAC777AGCCAAGACAAAACCCC 

AA77CAGC7AC77ACC7AGGG77GGG7C7CAGGC7GAAGACCGC7CAC7A 

CCG77C7AGAAGCAAGAAA7AAAAC77GAAC7CG7C77ACC7G7GTAGCA 

GGACAAGCCGCAGACAAAA7CCC7CAGACACCAAA77AAAGAAGGAAGGG 

C777A77GGGCC7GGAGC7GCGGCAAGAC7CACG7CTCCAACAACCGAGC 

7CCCCGAG7G7GCAA77CC7GTCCC777TAAGGGC7CACAAC7C7AAGGC 

GG7CCACA7GAGAGAG7CG7GA7AGA77GAGCAAGCAGGGGG7ATGTGAC 

TGGGGGC7GCA7GCACC7G7AG77AGAA7GGAACAGAACA7GACAGGGA7 

C77CACAG7GCT7T7C77A7GCAAA7AACCGA77AGA7CAGGGG7CGA7C 

777ACCAGGCCCAGGG7GTG7CACCGGGC7G7C7GC77G7GGA7TTCA77 

7C7GCC7777AG77A77AC77C777C777GGAGGCAGAAAT7GGGCA7AA 

GACAA7A7GAGGGG7GG7C7CC7C7C77ACC7GCGGGGAG7GAGC7CAAA 

C7CC77AAAGGAG77ACC7GCC77CCA7CA7CAGGGAAGCAGGAAA7C77 

GCC77CC77G77GGAAGCAAG7AAAAC7CAAAACAAACAAAGAAAAAAAC 

AGGGAG77G7ACAGCAAAA7AAAC7777GA7777GACCAAA77T7GGGAG 

A7CAGGAA77C7C7GAAGGAGA7GC777CAGACC7CAGCAAA77G7CC7G 

77GG777GAGCCATAAAG77AGC7CA7GC7GG7ACCAAACACCAG7AGGA 

GA777G7CAAAGG7AAGAGGCA7C7CCAC7CAGAA7CCC77CG7GG77AC 

CAACA7G7GAACC77GGAAA7C7GAGACAGG7C7CAG77AA777AGAAAG 

777A7777GCCACGG77GAGGACACCCACCCA7GACAGAGCA7CAGGAGG 

7CC7GACCACA7G7GC7CAGGG7GG7C7GAGCACAGC77GG7777ACACA 
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TTTTAGGGAGACZATGAGACATCAGTGAATATATGTAAGATGTACRCTGGT 
TCCCTCCAGAAAGGCAGAACAACTTGAAGCAGGGAGGGAGCT7CCAGGTC 
ACAGGTAGGTGAGAGACAAACAATTGCATTCTTCTGAGTGTCTGATTAGC 
CTTTCCAAAGGAGGCAATCAGATATGCATTTATCACAGTGAGCAGAGGGG 
TGACTTTGAATAGAATGGGAGGCAGGTTTGCCCTAAGCAGTTCCCAGCTT 
GACTTTTCCCTTTAGCTTAGTGATTTGGAGGCCCCAAGATTTATTTTCCT 
TCTACATCACTGTGGGCAGCTGACTAGGAAAGCTTTGTAGGACTGGTGGG 
CAGTGTGAGAGCCCAGTGGGGGGTGGTGGTCCTGTGCCAATGGTAGCAAC 
CACCTGTGAGGCTGAGTAAACTCATTTCCCAACCTCCTCTAGCAGCCCCA 
GTGGAGATACAGATGAAGCAGACTAGCGATACAACCCAGCCTGAAGTTTT 
GTCTGGTGAGTGTAATGGAATAAAAATGGGAAGGGTGCTGAAGAGACCAG 
CAAGAAAATGGTTGAAGAGATGGGGCACAGAAATTAAGCTGGATCAAAAA 
GGACGGAAAAGCAGAAAGGGCCGATAGAGAGAGGGGATATCTATGGGTTC 
GCGATTCTGAAAAGGACAAATCACTGGTGCTTTGAGAAGAGAGAGGGTGA 
GAAAGCAGGAAGGCTGGAGGCTGTCATCCAAGAGGCGGACATCTGTGAAC 
ATGATTCCAAGAGTCACCAGACCATGGGGGTGGCCAAAGGGAGTGCCTCT 
TCTCACCTCCTACTCTTAATTCCTTGTACTCAAGATAATAAGTTCCCAGA 
AGAGAAGTACCCATATTTAATTCATCTGTGTCTTCCTAGCAGTACTAAAA 
ATATTATATGAAAGGTATCAAACCTTTGAGAATGTGTGCTGCTAAATTGT 
TAAGGATGCTGGAAAACTCAAGACGTCCCTGATGG-TGAGCCTGAGTATGA 
GCCTGTGGTGAGCCCAATGCAGGTCTCCATTCAGACAAAGGCCTCAGGGA 
ACGGATGAGACCTAGGGACAGAGATGCATGCTGGAGCAGCATTCCCCATC 
CCTACTGCAGCTCAGGCCAGCTGACTGCTTTATGAGTAAACGTTACCAGG 
GAACACTTTGCAGTCTTAACACACATGCCCACCTGTGACCACTGATCCCT 
GTTGGGTGACCACTGACATCAGAGATTCGATGGCAGCAATGAAGACAAGG 
CTATCCTCATTAGGAAGGAAAGGAAGGAGGAGGGAGGAGGGCAAACGAAT 
CTTTCCTGCTTGTCAACCACGTCCATCTCTGTTAGGTGATTTCCCATGTG 
TGACTTTGTTTATCTTTATAATAACTCTGAGAGGTAGGTCTTGATGTCCA 
CATTTTGAACATGAGGACATCCAGCCAGGAAGTTGAGTTCTGGGGACATA 

TCCACTGAGTGCCCCCTGCAATCCACCAGCCCATTTGTGAAGTGCATACT 
ATAGGTAAGTTGGCACAGGAGGAGTGGATGTGGGCGATTTTGTCACAGCT 
CTCCAGGAACTTACACACTGGTGAGGAGGGCCAGGTATGTTCCTGACCAG 
TCACAATCAAAGCAACCTCCTACTAATCAGGGAGGCTTGGTACCTGGGGA 
ATGCTATGTTGAAAGGTTCTTTTCTGGGTTTTAAAATGATGGGTCTATTT 
CCTTATTCTTAAGATTGCTTTTTTTCTGGCTAGAACTTAAAAGAAATTTT 
CAGTAAAATTTCCCTTCCCTGGCACAAAGTGAGCTTGAAATGAATTCCCA 
GGTGGCCTTGATACTTTAAAATATTGCCTCCTATAAAATCAACCTTTAGA 
AGAAGGAAGTCAAAGAACATGCTAGATTTCACAAAGGTTAATTCCTTGAA 
ATCCAGTTATCTACAGGACAATGTTGTCAAAGAAAAAATTATTTGGCCAG 
GCACGGCGGCTCATGCCTATAATCCCAGCACTTTGGGAGGCTGAGGCAGG 
TGGATCACCTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAA 
ACCCCATCTCTACTAAAAATACAAAAAAAATTAGCCAGGTGTGGTGGTGG 
GCACCTGTAATCCCAGCTACACGGGAGGCTGAGGCAGGAGAATCGCTTGA 
ACCCGGGAGGAGGAAGTTGCAGTGAGCCAAGTTCAAGCCACTGCACCCCA 
GCCTGGGCAACAGAGCAAGACTTTGTCTCCAAAAAAAAAAAAAATTCAAT 
GATATTTTTAAATTCATGGTAAGGAAGATTTCATTCAGAACCAGCACAGA 
AGATATAGGAAACACTGCAATGGGACTTTGCGGTGGGGGAGAGAGATTGA 
ACACAACTACATATACAGCACGGGCAAGGACATATTCATAGCCAGGAAGC 
AGAGCAAAGATCAGTGGATGCGAAATTACTAAGAGGAAACATGAAAAATA 
AGGGAGCTTCTGCCTAAACCCACCTAACCGGATCCTTGCTGAAGACAGGA 
CAGGGTGATTGGACACCACTTTGGGGATGGTGGAGGATGGGGAATCCAGT 
GAGATTTCAAGGGTGATCAGATATTGAACATAGAAGGTTCTTGCTAAAAA 
AGGAGTTTACAAGAAAGTGTACAAATGTGCCTGGGAGAAGGTTCAGGAGC 
CTGACTAAAATTTGGTCAAGCAGAGAATATTTGCCAAGATAATAGCTAAG 
TCTTCTGACAAACAATAGATGCTAAGCCAGCAAGGGTGATGTGCTCAGAG 
AAAGCACTGAGGGCTTATTTCCTTTTCCCCCAATCTCCACTCAGTCAAGT 
CTAGTCCCCTTGTCAATGTAGCCATTTGTAAGAATGCAATCAGGCAGGGT 
CCCATCTCCTAGTGACAGGACTGACTGAAGTTC7GCTGAAGAGAGTGGCC 
TGGGGCTGACACCGAGATTTCAGAGTCCTGGGTT7CGCCGAGAGCTCAGT 
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GTAGTGCCATGCCCTCTLTCCACCTGAACGCCCAGTGTGGG(^GGAACAA 

CTGCAGCTAGAAGTCTGGCACTTACGCTGGGGTCTAAGACCTGCCTGATC 

TGCTAACTAGTCTTGTCCCTTGGCTATAAACTGACGTTGGCACCTGGCCA 

GAAAGATGAGCAAGAGATCTCTGACACACCTTTAAGTCCCTGTGGAGTAG 

GATTATGTTGGGGAAGGTCATTCTCTTGACTGAGCAGCAATTTCAGAAGG 

AAGTCCCATGCCGAAGTGAGAGAAGGCAGGGAATCCTGCCTAGTCAGCTA 

GAJSCAAAACAGTCTGCAGGACGGGACCCAGGGATGTGATCCTCCCATCCA 

AAGGCACTGAACTAAATGACTAAAATACTTTCCAGGGCTCACGTTCTTTG 

AAGAATGGGGACTAAAACTAAGACAGGAGCCAGCAAGTGAGGACTTGGAA 

GGAGATGGCTCATCTGATCAGCCTCCACTCAACAATTTTAATCATCCACA 

CTGGCATGGGGACACAATATGAATAAGTTGACAGGGACCTACTCTGATTA 

AGCAGTGGGCTAGTGCAGAGACCTGTCAGTCAAGAGTGGACAGGAGATGA 

TTTCAGACAGTGAGAACAAAATTAACAGAGTCATGTGCTAAAGGGTGGCT 

GGAACTACAGAGGAGTTTAAGACTCAAGAGGTCTGGCTGGGCGCGGTGGC 

TCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCACAA 

GGTGAGGAGATCAAGACCATCCTGGCTAACGCAGTGAAACCGCATCTCTA 

CTAAAAATACAAAATATTAGCCAGGCGTGGTGGCGGGCACCTGTAGTCCC 

AGCTACTCGGGAGGCTGAGGCAAGAGAATGGCGTGAACCCGGGAGGCAGA 

GCTTGCAGTGAGCCAAGATTGCGCCACTGCCCTCCAGCCTGGGCGACAGA 

GCGAGACTCCGTCTG^AAAAAAAAAAAAAGACTTGAGGGAGTTGTTTATT - ~ 

TTTGTTTTCTTTTTAAGACAGGGTCTTTGTTGGGCGCGGTAGCTCACGCC 

TGTAGTCCCAGCACTTTGGAAGGCTGAGGTGGAAAGATCTCTTGAGCCCA 

GGAGTTTGAGGCCACTCTGGGCAACATAGCAAGACACCGTCTC7ACAAAA 

AATGTGCAGGTTGAGGCTGCAGTGAGCAGAAAAACACCGCTGCACTCTAG 

CCTGGATGACAGAGCGAGACCCTGTCTCGGAAAAAAAAAGAAAAAAGACA 

GGGTCTCGCTGTGTCACACAGGCTGGAATGCAATGGTGCAATCATGGTTC 

ACTACAGCCTGGAACTCCTGAGCTCAAGCAATTCTCCTACCTTGGCCTAC 

CAAAGTTCTAGGACTACAGGTGTGAGCCACCACACGTGGCCTCAGGAGAG 

ATCTTAATAATAAAAGGACAAATTGCCTTGCATCCCTTAGGGGCAGGATT 

GACACATCCAAGGATCAGGCAGAAAGCCTGTGCGGAGTGGGATGAGCAAA 

GAGAAAGGCTGAGAGTTGTGAAGAGGGAGATGCAGTGCCAGCTAGGACAG 

GCCTTTTTGGGCTATGGGAGGTTTTCAGAGGAGACCCCACCTAAACTAAC 

CCATAACATTGCAGTGGGGACCTGTTGAAGTCATGGACTACTACCTGAAA 

GCCAGAGAAATGGGAGGAGCCTTTCCTCTGAGGAGGGACTCTAGTCCATA 

GGTATCTTGCCACCAAATACATGGACAGGCCCTGGGGGAAGATGGTGGTA 

GCCCAGCTGGAGGAAAACCATTTGCCACCTGAACTAGCCCAGGGTAAGCC 

ACCCAGGCACTGAGGGTGCACACCCATGCATGCACACACAGAATCACACT 

CCTTCCTATTATTCCTCAATTCAGGGGTCTCAACACCCATTTTTTT7GTT 

TTTGGGGTTTTTTTTACATGTTTACATTTTATTTATTTATTATTTTGTGA 

CAGGGTCCCACTCTGTTGCCCAGGCTGGAGCACAGTGCAGTCGTGCAATC 

ATATTAGATTGGTGCAAAAGTAATCACGGTTTTTGTCATTAAAAGTTTTG 

CCATTACTTTTAATGATAAAAACCACGATTACTTTTGACGCAACTTAAAA 

GCTCACTGCAGCCTCAAAATTCCTGGTCTCAGGGAATCCTCCTGCCTCAG 

CTTCCTGAATAGCTGGGACTACAGGCACATGCAATCCTACCTGGCTAATT 

TTTTAAAAATTTTTTTTGTAAAGATAGAAAGTCATTTTGTTGTCCAGGCT 

GGTTTCAAACTCTTGTCTTTGTGCCTCCCTCTGCCCTGTGCAAGACCTTC 

TGG ATGC C CACTAATGAAGACTT C CAGGGAGAGGAAAAGTAAACATAGGT 

CCCTGATCAAGGGACCAGGGTTTATCGACCACAAACAGCATGCCCAGATT 

CCACTGGCAGTCCTAGAGGTCGCATTTGCCCCAAGTGTGTGTGGAAGGCC 

TCTCCCTAGCAGTTGGTTTATACACCAGCCACAGCACAGCATATTCTCTT 

AAATTGTGAACATTTGCAAAAACTCCTTGAGGACAACTATCATGTCTTGT 

GTACTTTTGTTTTGTTTCCCTTCCCCTATGTACACGCGCGCGCATGCACT 

CATGCACGCACGCGCGCGCGCACACACACACACACACCCCTCAAACTGAA 

TGCCTGGTGTGCTGAATGGATGAATGGCTAATGTAAGTCATTCTAAAAGC 

TACTTTCTTTGGCATACCATCACCTTTGATTTCATCTTTCTGGAACTCCT 

ATGTTCCCAGATGAATTTGGAAAGCCCTCAGGAAACATTTCAAAATTGCT 

ATATGGGAGAAATGGGAGGGTCTCTCTAGAAATTTACCTGCCACAGGTAT 

TTCTGG7AAGACACAGCAAAGGTGGCACCACCCATTCCTCGTTACAATGT 

CAATGCCAGTCACCTTCCTGTCCCATAAAACTTTATTAAAGGTGCAGAAT 

TCCCATGGAAGCAGGTGGACACCATCTGCTTCCAGCCAGCCAGGGGAGCA 
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AGG7G7CCAC7G7GCC1T7G7GGCAGGAAC7GCGC77C7C7AC7C7CCCA 

C777GAGGCC7C7GGGGC7GGCC7GC7GCC7CC7CA77GACAAGGC7GC7 

7AC7GAGCAG77CA77C7GAGC7GGACA7AG7GC77C7GG7GAG7C7C7A 

CT7CTATTTAACCCAAAGATATTCTTTCCTAAGGAAACGCTTTCCTGTCG 

GGGGAGGTTAGCTCCAGATGGAAGTCACAAGTGATGGCATGGTAGCTCTC 

ATCCGTTTGGGTGGATGATATTCACGGAGCACCACCATGAGCCAGTCATG 

GAGGTGAACAGTATATGCCAGCCCTGAATCAGGTGCATTGACAGCAAGGG 

AGACAAGCAAACAAAGCTGAGGTTTGCTGAGGATGTTCAAGACTCACACA 

GCACAGAGGAGCATCCACCACCCAGCTTGGGAAAGGACTTGTTATAGAGG 

GGGTGAAGCATGAGCTGAGTCTTGAAAGACTAGAAATTAGCCAAACTACA 

AGGAGGAGAAGGAGTTTCCAGTCAGGAAGAACAGGTTATGCAAAAGCACA 

GAGACTAGAAAGAATATCACATTCAAGGAACTGCAAATAGACAGGAAAGA 

TTGATGCGTGGGATAGGAGAGGAGGGCAGGGGATTCCAGGTGGGCCCTGC 

TTGCCACACTCAGGAGCTTGAACTTATCCACAAAGGAGGTGTGGAACCAG 

TAATGAATGGGTTTTGTGCAAGGGCTTCATGTCACCAGATTTGCTTTTTG 

GAGATACTTCTGTGGCTGATATGTGAGGAAGGGATGGAGGAAGTTTCCGT 

GGCAATCAGGAAAACCAATTAGCAGATGATTCAAATGGCCTAGGGGAAAA 

GGGAGGAGGACTTGGACTACCATGCAGCAGCAGAAATGGAGAGAAATAAC 

AGATCCCAGGCACTCAGGAAGCGCTCAGAATGAGCCCTTCAAAGAACTTA 

TGGTAGGTGATGGATGGATGGAGTGTGAGTCCTGGGATAGCATTGCCTGG 

GAAAATACTTTCTAGTTGAGACAGGGAAGTGGGCCAGCAGAAATCGAGGG 

CTTCTTCTTTTTGCTTTAAATACTTTTATAATATTTGGAACTTTGAAAAT 

GAGCAGATATATTAGCAAAAAGCCTAAAAGGGATATTTTTGAAATCACTG 

CTAG7TCTAACATATAACTTTCAGCTTGCACACATCATCAATTAACTTTG 

ATAGCGCCTTTCTGAAACTATCATCCCAAATAGCAATCCTTGTAAAAACC 

TATTTTGAAAAACGGGCCTTGTAGGATAGCCTCACAGATGTTTTGTGGTA 

GATTTTCTAACATTCTAATGTCAGGGAGTGAAAGGAATCCCGTTAGAAGT 

TGGAAAATTCTGGAATCTCTATTCATGGTATTAAAGTTTTGCCGTCACAC 

AAAAGTTTAACACCTTTACACAATCAGACTTCCTCATTTTACATTGCTCG 

GTAATTAGAGGAAATCAGTCACCCAGAGCCTGGGTCCTAGACTTGACAAA 

ATGCACCCAACAAATCCTGAGTGGCCTTGCTGAGGACTTCTCCCAGAAGA 

TAGAAAACTCAGTTCCAGCCAACAAGGGGGAAGCAGCTGAAGAAGTGAAA 

TTAACAAAGTCCTGGAAGGAAATGACCAAATCATCTTTGATTGTGTAATA 

ACCAGAGAGTAGAATACAGCTACGACAGACATTTTGGGAGAGAAGCATTT 

TATCATAGCTTTTAGAAGAGAATATTTTTCAGCATCATAAGCACACAATT 

CCAAGACAGATACTTTCAAGGGATTGTTTTGACG 

>Contig53 

ATGTTNNGGTTTTGGGACCCCATTCAAACTTCATGTTGAATTTTAATCTT 
CAATGTTGAGCGAGGTCCTGTGGGAGGGTGATTGGATCATGGGGGTGGGT 
TCTCCCT7GCTGTTCTCAATGATAGTGAGTGAGTTCTCACAAGACCTGGT 
TATTTGAAAGTGTGTAGCACCTCTCCCCTTCATTCTCTCACTCGTCACTG 
CTCCGCCATAGTAAGATGTGTGTGTTTCCCCTTTGCCTTCCGCCATGATT 
GTAAGTTTCCTGAAGCCTCCCAGCTATGCTTCCTGTACAGCCTGTAGAAC 
TGTGAATCAGTTAGACCTCTTTTCTTCATAAATTACCCAGTCTCAGGTCA 
TTCTTTATAGCAGTGTGAGAGTGGATGAATATAGTGCCATATGTTTGTAT 
TCCCAGCTACCCAGGAGGCTGAGGTAAGAGGATTGCTTGAGCCTGGGAGT 
TTAAGGCTGCAGTGAGCCATGACTGTACCACTGCTCTCCAGCCTGGGTGA 
CAGCGAGACC7TGTTTCCAAAAAAAAAAAACCCAAACTGTGTAAAATGTG 
TTCATAAAAGTGTCTTGCTCCCACACCTGTCCCTATATATCTTATTCCTC 
AGCCTCCGACAACTACTTTATTCATTTCTTATGTATCTTCCAGAATCAAA 
AAAAAAAAATCAAATACAAGCACAGTGGAATGTATTGCCCTTCTTCCCCT 
CCCTTTTGTTACATCAGAGTTAGCATATCATAAATACGGTCTGCATTTTC 
TTCTTTTTCAGCTATCAGCATGTTTTGGAGAGGATTTCATATTCGTGCAG 
ACAGCATGTATTAGTCAGTCCTTGCATTGCTATAAGGAAATACCTGAGAC 
TGCATAATTTATAAAGAAAAGAGGTTTAATTGGCTCACAGCTTCGCAGGC 
TGTTCCACAGGAAGCATGGCAGCATCTGCTTCTGGGGAGGCC7TAGGAAG 
C7777AC7CA7GCAGAAGACAAAGCGGGAG7GGA7G7C77A7A7GGCAGG 
AGCAGGAC7GAGAGAGAGAGAGAGAGAGAGAAAGGA7GCCACA7AC7777 
AAACAACCAGA7C77G7GGGAAC7C7G7CACGAGAACAGCACCAAAGGGA 
7AG7GC7AAACCA77CA7AAGAAC7CCACCCCCA7GA7CCAA7CACCCCA 



FIG. 4 (46 of 61) 





WO 99/06426 



PCT/US98/16102 



CACCAGGCCCC^CCTCCAACATCGG 

GGCTGGGACACAGAACCAAACAATACCAGAGTGCTTTCTCATTCTTTTCT 

ATAGCTGCCTAGTATTCTATGTCCTTTACTTCATTTAGGCAGTCTCTTGT 

TGATAGACACTTGGGTTACTTCCAATTTTTCCTATTACAAATGATGTGCA 

ATGAATAATTTTGATCATTTTCCATTTCACATGGGTTATGTCCATCTGTG 

GGATAAATCTCCAGGAGTGAAATTGCTGGATCAAAGGGGAAGTGCACTTG 

TGATTTTCATAGTTAGCAAATTTTGTTCTATAAGGGTCATATCAATTTAT 

AGTCCCACGCGTAATATTTAACAGTGGGGATTTCCCGACAGTTTGACCAA 

CAAGGTCTGTTGTTAAACTTTTGATTTTTGTCAATCTGATGGGAAAATAC 

TAGTATCTCAAAGTGCTTTTAATTTGACTTTCTTATTACAATGTTAAGCA 

TCATTTTACTCTGCCCAAGATCAAATAGTATTTTCTTTTCTGTGAACAGA 

CTGTTAAGATCCCTTGCCTCTTGTTTTGC7GGATTTTTGTTCTTTTTTTT 

CAAATGTTTTGAGGCAGTTCTTTACATGTGAAACAAGTTATCTCTTTATC 

TGGGGTGTGAGTTACAACTACTTTTCCTCTGGCTTGTTTTGCGCTTTGAC 

TTTGCTTCTGGTGATTCCCGCAATTCTGAAAGTGTACTTTTTGCATCATT 

CATTCTTATACACCCATGCTCTTGTTCACGCTGGTTCCTCTA€CTGAGGG 

CTTTTTCTTTTCTTTTCTATCTGGGAACATTTTTTAGAGACAGGGTCTCA 

CTCTGTCATCCACGCTGGAGTGCAATGGTGCGATCACAGCTCACTGCAGT 

CTTGAACTTCTGGGCTCAAGCAATCCTCCAGTGTCAGCTTCCCAAGTAGC 

TAGLGACTACAGGTGCATGCCAGCATGCCTGGCTGATTGTTTTATTTATTT 

ATTTATTTTTTGTAGAGATGGGAGTCTCACTATGTTGCCCAGGCTGGTCT 

TGAACTCCTGGGCTCAAGCGATCTTTCTGCCCCTGCCACCCAAAGTGCTG 

GGATTACAGGCGTAAGCCACCATGCCCAGCCCATGTGTGGAAATCTTCTG 

TTTATCCCTTTAGGCTTGATTCTTATGTCGTTCTCCTCCCTCCTTCCTGG 

CTACTCCTCTTGTTCTTTATCTTACTCTACTTGTCATGTTACCTTGTTTC 

TGCTTATAACTAGCTGCCTCTCCTATCTGAGGAGGGACTTGTGACTGTTC 

TCATCTCTGTACTCCCAGGTCCTAGTACATAGCGCTTGCTCAACAGATGT 

TTGGTGCATTGATAGATAAATCAATGGTAGCTGTTAATACCAGTCCTGAC 

TCCCTGCAGTGCTTCAGCTGATCCTGTTCCAGATGTGCACTGAATATCTT 

TCTGTTGAACAACAGAAATAAAGGGGATGGGTGAGGAGGATAGTCTTCGG 

TGGCCAAGGATATTTGTAGGTACTTTGCAGCACTCAGCAATGAGGAGTGG 

GCTTTAGTCCCCCAAGAACTCTCACAGCCCTGTTTGTCTTTACTGTTCAG 

TGTCAAATCCAAGACAAGTCAATGATCAGGAAAGACCTTTTTTTTTCTTC 

AGTGAAGTTTATTTCAGAACCATTGAACAGTATGATATTTGCTCATTTAT 

AAATATTC C CATTTAAATAATCTGAGCTTATATATTTTCAGTCTTAATTA 

AAGGACTTGATTTAAAGAGAGCACACCAGTCCAAATTGAATTGATTCCAT 

AGCTATTAAAAACTAGGCTCTTTTACAGACACTGCTACTTCTTGCCCCCT 

TTGAATAAATTAGACCAATGAATAAAACAAACAAACAAATAAATAAATAA 

ATAGGGAAGCGGTTGCTCATCAGAATGTGGGAGCGAATGACAGAGGGTTT 

CTTAGAACCAAATGTGGCCGTGGTTTCTGTCAGGCGGGCTTTAAGTGAGT 

AGGAGAGGTGAGAGAGGCCTGGCTCAACAAAAGGGCTGGGGATTGGCCCT 

GAAAGGAGAGAGCTGACTGTCCTGGCTGATGGACAGGAGATCCTCTTAGC 

ACTACCCTAAGGCAGGCAGTTGGGCATTGGTGTAGACAACAGGAAAGTCC 

AGGCTATAGCCGTACTCAAAAACCTTTCTGTTCCCTTTCTGCCAGCCCTA 

GGGATTGAGTCCACATTCAGCACAGGACTCTCTGGGTACAGCTCTCTTTA 

GGAAGACACAAATTGCATGGTGAAGTCAGTTATATCCTGGCCGCCTTTGG 

TCCCTCCCAGGAAGACGGGCATGTTTTCTGCTTGAGAGGTGCTGATGTAC 

CAGTTGGGGAACTGGGCAGACTCAAATTCCAGCTTGTTATTGATTTCTAT 

CTTGTTGAAGACAAATCGCTTTTCCATCTTCTTCTTTGGGTAATTTTTGG 

GATCTACACTCTGCAGCGAAAGAGAAAGAAGAATTTTTGTGGGGCAAGGG 

ACAAAAATGCTATGGGAAAGATGTTCTTTGGGTTGGCCAGAAAGGAAACT 

GACGAGCAGGTCACATGATCAGGAGCCACACTCCTGAGTTGTAACTGGGC 

CCCCAACTTTCTGTGTGATTATTAAAAGAGCCCTTCTTCTTTTCTAAAAC 

GGGGGGGTGAAAATTAAGCTAGAGCTTCTTGAAGTACCTAGTTTCCAGGG 
GCTTTTTATTGTATTTTTCCTTATGGTCCTAGAATGACATCAACTTGGAA 
ATGAAGCTTTTGCTGAGAAAGCTGGAGGTGATAGTGGTGGTGATTTTGGG 
AGTGGAGTGGACGTGATAATGGGACCCTTTAAGTCATCTATTTCCCAAGG 
TGTCTATCAAATGAGAGCAGCCCTAACAATATATAATCTGTTGGGGTTGT 
AACTATGGTAGGACATAATAACATCGGCAAAATGATTTAATTTTCTGCAG 
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CAGGA77GAAGG77GCAlGCAG77AAAA~ rTATGTTAAATTTATTTACAT ' 

TAATGCAAAATTGTCAAATAGACCTGTTCCCAGCTTTTCCTAGGGATGGG 

GGCGGGGAGAAGGTGGTTGTCTGGGAATAAGTGG7AGCAGGAGGCTGAGA 

AGGGCTTCATTCCATAGCATTCACTTACCTCCAGCTG7AGAGTGGGCTTA 

TCATCTTTCAACACGCAGGACAGGTACAGATTCTT7TCCTTGAGGCCCAA 

GGCCACAGG7ATTTTGTCATTACTTTCTTCTCCTTGTACAAAGGACATGG 

AGMCACCAC7GAAGAAAGAAGGGGGTCTTGTGGTTAGGGACACAGCAGT 

GCAGGG7CACCCCAACCCC7AGGCCCCA7GAG7AGGA7ACA7G7AA77TG 

G7AGCC7C7G7GGGAACCCACAG7GAGG77CC77GGCC7AAGACACAGGA 

TAAC77GAC77C7CACAGACAA7AGCAGGG7CA7777G77GA777AGGG7 

77CCCC7CAAAGGCC7GAGGG777C7CAGAGCC7CA7AGCAG7AGGAACG 

GAGAA7GAAAGAGGG7C7ACA7777AAA7GC7GAAGGAAGGAAGGAAGGA 

AGCCA77G7G7CAC7GGC7GGCAA7G7GCCCA7CCACAGGAGCGGAACAA 

C77GA7CAA7G7GGAAGGAAAGGAAAGAGG7GAGGC7GTAC77C7GCCAG 

AAA7CAGGCACCAGAAC7G777CAGGAACAGAGAG7AGCCCA7GGGAAGA 

AAC7GGGAGAGGAGAGGC7GAGC7GGGAAAG7GGC7CCAAAGAGAGACAC 

7CA7777GA7C77CCTCAG7CACAGCAG7G7CAA77GGAAGGCCC7GGGA 

7CAC7C77AC7ACCCGA77CCAAAGAAACAGGA7777C77GGCC7GGC7G 

AGAGCAAA7AGC77CCCCC77GAG7GAGGC7G7CC77CAAAG7CAGCAGC 

C77AG77GCCCACACTCC7G7GCAGAGGC777GGC7AC7G7GGCACGA7G 

CCAGGCAGA7CACCACAGC7AA7GA7GGG77CACCGCAC77GAAAC7777 

GCCCG77ACAGCGGAGAGA7A7AAG77CC7GC7GGGCGG7AAAA777CCC 

7ACAAGGAACCACC7GGCA77GGG7GGGACGGA7G77GGGGCAAGGGGGG 

AAGAC7GGGGAGGGGGA7GGACACA77A7CGC7CCAGCAC7C77G777CA 

GCC7CAACAACAGGAAGAGAGAACCCACAGGCAG77AGGCCA7G7CCA7C 

AAA7GACCCCA7A77G7GGAAGAA77GACA77GCAC7ATGCCCAAGAGAC 

77GGG7GGACA7GG7CC7GGGAG7GC77GAGCCGTC7AA777C7CAGGG7 

CACAC7CC7G77AACAAA7GCAC7GGCCAG7GCAA7CAAA7G7GCCA777 

C7AGGACCAAAG777G7A7A77CC77777AA7A77777777CAC77G7G7 

7GA7CA777GCC77AAA77AAC777C7AC777G777AAAACA7GGAGAA7 

7AGCAAGC7GCCAGGAAGCCAGGCAGGGAAACCAGGA7G777CCA777AC 

C77G77GC7CCA7A7CC7G7CCC7GGAGG7GGAGAGC777CAG77CA7A7 

GGACCAGACA7CACCAAGC7TT777GC7G7GAG7CCCGGAGCG7GCAGT7 

CAG7GA7CG7ACAGG7GCA7CG7GCACA7AAGCC7CG77A7CCCA7G7G7 

CGAAGAAGA7AGG77C7GAAA7G7GGAGCACA7G77G777AGG7A7AAAA 

7CAGAAGGGCAGGCC7CG7GAGGCAAGG7GGCAAAA777GA777C77GGA 

GGACACC7GAGCA7A7ACGG7CAAAG7C7GA7GACAACACCAG7AGGGA7 

GAAGC7GGGAGTGGGGTGGCTAAGAACACTGGACCTGACAC7ATTAGACA 

7GGG77CCAGC77CAGG7C7A77AC7GC7CAC7G7GGCCGAGCAACAGAG 

C7AC77AGG7AAAA7GG7GA7GG7CA7AACAC7AGCCCACAGGGAGG77A 

CGAACC7C7GG7GACAA7GTAAG7GAAAGGCCCC7GAGAAAGAG7GAGGG 

AG77GCAAA7G7CAG7AGCCA7CAAGA7C77C777AAGAA7AGTT7CCAC 

7AAAGAGA7GA77GC777GG777CCAGCC77C777G77T7G7C7CCCCGC 

7GGGCC77C7ACC777AAAGGGC777GGC7C7GGGGGAA77GAG77GGC7 

GGGGC77GA7GAC77CCAAGAGGACACAAGTGGAGA7C7AC7GCC7GCTC 

77GGC7AAC7ACC77C77CAAAGA7GAAGGGAAAGAAGG7GC7CAGG7CA 

77C7CC7GGAAGG7C7G7GGGCAGGGAACCAGCA7C77CC7CAGC77G7C 

CA7GGCCACAACAAC7GACGCGGCC7GCC7GAAGCCC77GC7G7AG7GG7 

GG7CGGAGA77CG7AGC7GGA7GCCGCCA7CCAGAGGGCAGAGG7CCAGG 

7CC7GGAAGGAGCAC7GCGGAGAGAGCGAGGGAGGGAGCC7GG7GAGG7G 

G7CC7GCCAGGAACCA7GC777GACA7CAGAGAG7AGAAAGC7CAGAGAG 

GAGGAAAGGGC77GAAAGAA7CCCGAGC77C7AAAGA7CA7CCC7C7C7G 

GGCCAGGCG7GG7GGC7CA7GCC7G7AA7CCCAGCAC777GGGAAGCCGA 

GG7GGA7GAA7CA777AGG7CAGGAC77CAAAACCAGCC7GGCCAACA7G 

GCGAAACCCC77C7C7AC7AAAAA7ACAAAAA77AGC7GGG7G7GG7GGG 

S7GCACC7G7AA7CC7AGC7AT7CAGGAGAC7GAGGAAGGAGAA7CGC77 

GAACTCAGGAGGTGGAGGATGCAGTAAGCCAAGATTGTACCACTGCACTC 

CAGCC7GGGCAACAGAG7GAGAC7C7G7C7CA7AAAACAAAACAAAACAA 

AACAAAACAAAA7AAAA7AAAA7AAAA7AAAAAGA77A7CCC7C7C7GAA 

GC7CAAGGAGG77AAGGG7G7AC7CAAGGGCACACAGCAGG77AGAGGCA 
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GAC7CAAGAC7AGAA7GTGGGC777C7Ca/*CACC77ACAGGC7A77C7777'"' 

AGAATAAATCCCATT7CTACTTTGTTCATCTTTTTTGTACATGCCCCACC 

TACACCATACATGTATACCTTCTCTATATCTTTTTGTATCCCTAATGCTG 

TCACACTATGATTTGCTTTTTCATGCAGATGACCATAACATTTTCCATTC 

ACCTATGCTCACTCAGCAAGTATTCAATTTTTCTACACTGTTCTTTTTTT 

TCCTTTTTCATAACACTGTCTCATAGGCATTCTGCAAATCCTGTGAGAGT 

ACJTTTTGTGAAATGTTACCACTTTCCTCTTATTCAGAGAAGCTCCGTAT 

TAAGGCTTCACTGAGGTTGCCTTAAGGCATGATAATGGTTCAAAGGCTTG 

AAAGACAGTTAAAGAGACCTGTAAGTGCACAAAAGAAAGTTGAGCAGGAG 

AGAA777C77GCC7GGAGCAGAGCCAAGC7AC7GGAAGAGGCAA7GGGGG 

CAAAGGCCAGGCAGACAAGCCAATGGGCTCCTCCCACAGCTGCAGCCAAC 

AAGTTATGCCAGTCTTAAAACTTCTAAAGAAATATGTTTTTAACAAGATT 

GAGGACTGGATTATGAGGCTAGGGGAGGCTATCACAAACTGGAATAAAAT 

AAAGCCAGAGAAAAGTGGCTGCCTTCCAACCTGCACAACTGACCTAGCTA 

GGCTGATGGCTGGGCCACCTAGGAAGGCTACTGAGCATCATATAAAACAG 

AAGGGACAGCAGGAATATAACATGGCTCTTTGTAAGGATGAGTCTGAAAA 

ATGACCATTTGCTGCCCAAATGCCCTTAGCTACAACTGAAAATATTTCAG 

AACTGGAGGTTGCAGGATGCTGGAATCTCAGAGATCATCCAGCTCAGCCC 

TTTATTTTTCAGATGAGGTCCAAAGCGGGTAAAATGACTTGTCAAGGTCA 

AACAGCAAGTGAATGGTTTTCTTTCAAGTCTCAATTCATCTTTTTGTTTA 

TATCATCTATGTCTTGTTGTTATAAGCTTCACCCCAGGTAGCAAAAAACT 

ATTCTACTCAAAAGGGGTAGACATATGTTAGTTGTCAAGATCATCTCTTG 

G777CAGAG777AAC7CAAG7GA77GGCA7AGGC7GAA7CCA7C7C77AA 

AAGGATAATCAAATTTATGTTGAAGACTTGGTTGTCTTCCTACTATGAAA 

TGGGAAACATTATCACTACTCCTCCCCTGTCACCACCAAGTGTGGCCACC 

ACCACCAACGTTAGTGAGTGACTGTGGTGATATGATGACCAAGTGGCCAG 

GTCAGCAAGTGG7GCAGCCTGTGTCTCACTGGAAGAGGTTAAAGTCTTTC 

TAAAACAAAATACCATGGCATCAAAGTGGCCCAGAACTCCCTTCTTTGAG 

CTTTCCCTGTGTTAGAGCCCTTCCTTGGGTTGGGAGTTAAACCCATAGTC 

TTACCTTCATCTGTTTAGGGCCATCAGCTTCAAAGAACAAGTCATCCTCA 

TTGCCACTGTAATAAAAACAGGGACATGTCTCAATTATGTCTTCTAAACA 

GGTTTATTTTTCCTTCCCTGTGTACAAGACTTGACTGTTCATAAGAAACT 

GCAAACAGCCTGCCTCTCAAAGCTGCCTGAAACACCTGGCAAGTTTCACA 

GTGATATGCGCAGAACAGTCCAGAAGGCAGATTCTAGGCCTGGCAGGTGG 

GCACCCTGGGTGCTCCCTGTTGGATCTTGAGGCCTAACCTCTAGCCCAGC 

AGAGTCAGCTAAAATCTGAGCTCTCCCTCTCCCTCCAAGCCACACTTTGC 

AAAGGGATTCCTTGTATTGTGGGCTTGGAATCTTTTCTCCCCATTTGCCT 

CTGCAGGAAGCCCTTGCAACAACACATCTGGATAGCCTCCAGGTCCCAAG 

GCTGGAGGGACTTGTAATGGGAAAGTAGTCTTTAAATCAGATTTACTTGG 

CACCCTG7TTGCCACTGAAAGAGGCAATTTAGGGGAAAAATCTGGTCTCC 

AAGCACAGATAACACTCTACTCTTGAAAGAGGAGACCTGCTCATGTTACT 

GGTCTCAGCGTCTCCACTGACCTGTAATAAGCCATCATTTCACTGGCGAG 

CTCAGGTACTTCTGCCATGGCTGCTTCAGACACCTGTGTAAAAAGGAGAA 

AATGAGTGACTTCCCCATGACGGCTACGTTCATGTGTGATTTCTCTCAGC 

ATCCAGTGCATGGCAGTCATGCAAAGAAATGATCTCTGAGTAAATGAATG 

AATGTGTGAAAGAGAAGTCCTTTGGGTCTAGAGAAAAGCATTTGCTAAAC 

CAAACCCCAACTAGCAATGTATTGGCTAGGAGAGCTGGAGCAGAGGCTTT 

GACACTAACCTTTAGGGTGTCAGCTGTTAGATAAGCAGTATCCATTCCCA 

GAATATTTCCCGAGTCATAAGCATTATATTACACCTGGCATTTTTGCAAA 

AAGCTGAGAGAGGGAGGCAGAGAGGGAAGGAGAGGGAGAGACAGAGAAAG 

AAAGAGAGAGAGAGAGAGAATATGCATACACACAAAGAGGCAGAGAGACA 

GAGAGACTCCCTTAGCACCTAGTTGTAAGGAAGATTAAAGTCATACTTGA 

GCAATGAAGATTGGCTGAAGAGAATCCCAGAGCAGCCTGTTGTGCCTTGT 

GCCTCGAAGAGGTTTGGTATCTGCCAGTTTCTCCCTCGCTGTTTTTATAG 

CTTTCAAAAGCAGAAGTAGGAGGCTGAGAAATTTCTCTGTTGAATACCTG 

ATTTCACAATCAAGTTAAAGGAAAGGGGAAAAGAGTATTGGTGGAAGCTT 

CTTAGGGGAGGGGACTAATAAACTGAGATAATTCTCTGGTTCATGGAAGG 

GCAAGGAGTAGCAAACTATGACACATTTTGCAAATGTATCACCATGCAAA 

TATGCATTGT7TTCCTGACAATCGTTG7GCAGTTGATGTCCACATTAAAA 

TAC7GGA7777C C CACG77AGAAGAA7G777AAA777AG7A7A7G7GGGA 
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C AAAGTGGAAGACACACAG ATTTATACA * GCACATACTTTTCTTCATTCA 

CTTCTTTGTACTTAAGTTTAGGAATCTTCCCACTTACAGATGGATAAATG 

GGTACAATGAAGGGCCAATAGCCCTCCCTGTCTGTATTGAGGGTGTGGG7 

CTCTACCTTGGGTGCTGTTCTCTGCCTCGGGAGCTCTCTGTCAATTGCAG 

GAGCCTCTGAGGAGAAAATTGACCTTTCTTGGCTGGGGCAGAGAACATAC 

GGTATGCAGGGTTCAGGCTCCTGACGGAGTTGGGGCAACCCTGGAGATAA 

GCTCACACAACCCTGCAAGACCAGGTGCTGTTACCCTAGCCAATCTCATG 

GATGAACCAGATCAATGCCAGATGAGCTCTGCCTAAAATGATTTT7TGGT 

GAACTCTGAAAAGTGGAATATTGTTTCTGTAAGAATATCCATCTGAGACT 

CTATCTCTTGGTAATACCAAGAGTTATCAGTTTCTCTTTAACCGAGACAC 

CAGCAAAGTGCCTGCTCCAGGGTACTGCCCAGGGGAGCCCTCCATTTGTA 

GAATGAATGAGAGTCCAGGTTATGAACAGTGCCTGGAGTGTAGGAACACC 

CTCCTTTGCCTCTTTGACAGGTCTGCATCATAACACTTTTTTTTTTTTTT 

TGAGACAGAGTCTCACTCTGTCGCCCAGGCTGGAGTGCAGTGGCACGATC 

TCGGCCCCCTGGAAGTTCCGCCTCCCGGGTTCACACCATTCTCCTGCCTC 

AGCCTCCCCAGCAGCTGGGACTACAGGCACCTGCCGCCACGGCCGGCTAA 

TTTTTTGTATTTTTAGTAGAGACAGGGTTTCACCATGTTAGCCAGGATGG 

TCTCGATCTCCTGACCTTGTGATCTGCCCGCCTCGGCCTCCCAAAGTGTT 

GGGATTACAGGCGTGAGCCACCGTGTCCAGCCTGTAACACTTCTTATAGC 

ACTGAGTTGAAACCTTGCTCCTCCTGGTTCCTCCAGGAAACTGAAATCTT 

TTTGAGCCAAGTCTAGCACAGTGCCTGGCATGTACATTCAGGTGGTAGAG 

TTTGCTGCTTGAATGGGTGAATGGGAATTTGACAGCATTTTTATTCAAAT 

TAGTATGTGCCAGGTATCGTGCTCGCTCTGCATTATCCAAGGGAGTGAGC 

CTCTGTGCAAGTATTTGAGACACGAGGGAAATAGGTTCTACTGTGGGAAA 

AAGAGCATTTCATGGACTTGCTCTCCAAGCAGCCTTCTGATTTTTAATTT 

GGCTCCCAGTATCTTGATATCAGGAGTCAGTCACAAGAACTCCATCTTTA 

GTAAGTTATATTTTCCACAGGAAATCTAAAAGCTGTTCAACATGTTAGTT 

TCCTGTGAATTTGATAAGCCATAATCCATTCCTAACACTGAGCCCTCCTG 

AAATTTGGTGTCTGGTCCTGCAGATAGCTAAAAGCCCTGTCTGGGTGGCC 

TAGGGGACTCCTCTGTTTTGCCTCCACAGGATCCACTTTGCAAATTAACC 

ACTGGTTCTCCCGTTGTAGGAACTGCCACCTTCCTCAGAGCCTGTCTTTC 

TTCCTTCCTTCCTTCCTTCCTCTTTCTTTTTCTTTCTCTCTCTCTTTCTT 

TCTTTTCTTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTT 

TCTTTCTTTCTTCCTTTCTTTCTCTTTCTCTCTTTCTCTCTTTCTCTTTC 

TTTCTTTCTCTCTCCCTCCCTCCCTCTCTCTCTTTCTTTCTTTTTCTTTC 

TTTTCTCTTTTCTTTCTCTCTTTCTTTCTCCCTCCCTCTCTCTCTTTTTC 

TTTGTCTCTCCCTCCCTTCTCTCTCTCTTTCTCTTTCTCTCTCTCTCTCT 

CC7AGACAGGATCTACCTTTATCCCCCAGGCTGGAGTGCAGTGGTACAAT 

ZATGCATTCATTGCATGATCACAGCAGCCTCAAACCC7TCCTCAGAGTCT 

TTATGCGGCAACCAGCAGGGTCTGGAGGGTTGGTGGCTCTGTGAACTCTC 

CTGACAGAACACAGAGATGTCTTTGGTCTGTTGATGTGATTACAAGCTGA 

ACGAAGGAGGATCAAAGCCAGTGACAGGAAGGGAGATATGCAAGGGACCC 

GAGCATCAGCTCTGAGTTAGTCCATTCTGCTTCTGGGACTTGGGATACAG 

GTCAGAAACCTTGAGCTTCTACTTCTCCATCTTCCAATTGTAGCATCCAG 

GACCTCAGAATCTGCCAGCTAAGAGGAGCCGTAATGATTGTCTGGTGGGA 

TATGGTGGGACCACAGAGATGAAGACATGAATAGCTATTTGAATGTGAAC 

AGCAGACGAAGAAATCAAGGCTAGGAGGGTGGAAGTGACTCATCCAATAG 

CACAGTGTGGTTGAAGCAGCACTAGTATCCAGGTTGCATGAGCCCCTGAT 

GCTTTCGCTCGAGGGAAATTTTGGAGCCATGGGGCAATGCCCCCTGACGT 

AACAGTCTCCACAGTTCTGCCATGTCTCATCCTGGCCCTGTAACCTGGAC 

CCAAATCTGCTACCATCCCATCCATCTCAGGAAGTGAAACCTCTTATGTC 

AAATAGGTTGTGCAACGTATGTATCAGATCCTGTCTTCCCAAGGAGACCG 

CTCAGGCCACAGCACTTCCTTCCGATCCCCAATGAGCAGAAAATATCTCG 

CTATAAACATAGTTGGCACTAAGGGAGGGAGTGGAAGAGTGATGATGATG 

TAGATGGTGATGTAGCCCCAAGGAAGTGGAACAAGCAGAGATGGGGAGCT 

GGAAATGCCAGGATGCTCCAGCTTTTGGGGAATTATTCAGCTCTTGAGTC 

ACTAAAGCCTTTCTCAGCTGCAAGTTCCTCTTTACCCTGTCAGGTCATTC 

TTCCAAGACAGGAGACTGACATTTATTCAAAGCAGCAAGTGCCCTGATAC 

CATCTTGTGTCTAATCATGGGCTTCGCAGCCAGTTATCAAGGTTGATCTC 

ATCTCATTGGTCTTCAATCATTTTGAACAAGAAGACAAGCAAAATAATCA 
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TGGGTTAGTTCTTATATTATTGTGTGTACATGCAGTGATGTCTGTTCTTT 
GTAGTGAGCTGTTCCTTCCTTGTTCACCCTCTTGCTTAGAACAGAACTAA 
GCAATCTGCCCCCAACATTTTCCCCAATTTCCCATCTCATTCTTGGCACT 
GGCTTCCTAATATTTGTTCTTATGAGTCATTTTCTTGTATCATTTCCATG 
AGTCCCTCTGGGATCTTAAAGTATGAAAAATGTTGTGTGTACCCACACCT 
GTCTTTG7GGATATTTCTCTCCTTTCCCTTCTGCTTCTGGGATTATTTGG 

ga^tgggcactatgatttttatcatatcgcttccacttcctttatggcat 

catctccaatgggcttcttctccctcttggatccaggttctcagattggg 

gacatgcagagtccaaggaacattccattctcctccctggtctagaacaa 

ggagggcttagatatatgagcaggtggctggggctggcgagctatgtagt 

ctccaatggcttttccctgatgtcggagttgttatgtcagttctgggaga 

ccaataagaccttgtccttcctttggatccatcagaaaaagcccctgggt 

gggtaagatggatggcagggctctcctactctatgtcttttctcacacct 

agtgggtataagagaggggaccacaaacagagggggctctggtaccactt 

atccagggtctggaaacattttctgtaaagggccagataataaatgtttc 

aggtacaactactcaaccttgcatcatttcagaaaagcagtcagataata 

cataaatgaatgggtgtggctggacttgtcctgcggtcccctgtcttata 

tcattgtattatatcattttttcttacatacaaatttagaagcaatactt 

aaaaaaaaaaagccgtcctttattgagcacctactaagtgccaggtacct 

ttttttccctcattatcttatjaactcttcataataacctttaaagtaga- 

taatattgaaccatttgacctatgcagaaactgaggttgagacaatXaat 

tatt7aagaccgcacaaacagtaaatgctggaactacgactcaaatatgg 

GTTAACTGAACCAAAACCAGATCTTTATTTCTCACTTTTAATTGTTAGAT 

ATGTTTATTGCCTCATCTCCTGTCCACATGGTGCCCATCGGCAGACTCCT 

TTCTCATTCTCAGTGATTGAGTGACATTCTAAACTACATTGGCCTGGCAG 

ATTCACCTCTGTCCCCTAAATGTTTCCACATTGTCCTTTTAGGATTGAGA 

TCCTCTCTGTTCCCTTGTCTTCCCTCCTTTCTTCTTCTGGCGGTGACGTG 

CTGTGTGAATTTGTTTCTTTCTCCTCTCAGGGTAGTACTGGGACTTTCCA 

AATCAGGGTTTTTAATGATCTCTCTTCNCTTTTCTGAATTTCTTCCTTAT 

TCCCATTCACTTTCTCATCTATAAGTGGCANCTTTGTTGCTGGAAGATAT 

CCCTTGTGCAGGGATTNCTCTTTAANAATTTGTCNNNACC 

>Contig54 

GTGATCGTCAACCTCCCACCCTGTAGGGCCTCAAGCATTGAGGACAATCA 
CTGGCTGCCCATTAACCCAGAAATGTTGCCGAGACAGGAGGCCGTGGCCC 
AAGTTCCTGGAATGGGGTATTATTATGTCAGCACAAAGGCCTTTGCACAA 
ATGAAGGCTTTAAAAATGCAGTCCTAGTCAGGTGGAGGAGGGCTTATAGG 
ATTCCCAGGAATCTGGATCATTCTCTTGAGAGCTTTCCCTTGTCTCTGTT 
AAAACTCACATCGTACGGCCCAAATAACAACAAAAAATGGATGTAAATTC 
TTGAAATAACTTGTGGATGGGGGAACAAGGCCCACCCCCCAGATCTGCCA 
GAAGCTTCAGGTGAGGGTCCCAAATGCCAAAAAGTCTGGTATCAGAGAGG 
ATGGCCAGTGACNTGGGGACACATGCCCTTTGCTGTGTCACTCAAGGAGC 
AGCAGCTTCGGCCCCGCACAGTGACCAGGACCCTGGCTTCCCACGCTGGG 
CAGGAGCTGGTGTCTGATGAAGGGAATGCCTGGCAGCACGTGCTGTCTGT 
CTCCTCGTGTCAGCTTACCTGGCTTTGCTGCGAAGAGGCCACTTGCATTT 
CTTTATTTTTTATATTTTTTTAATTTTTTAAATTTTTTATTTTATTTTTA 
TTTTTATTTATT7ATTTATTTTTAATTTTTTTTTAATTTTTTAAATTATG 
CTTTAAGTTTTAGGGTACATGTGCACATTGTGCAGGTTAGTTACATACGC 
ATACATGCGCCATGCTGGTGCGCTGCACCCACTAACTCGTCATCTAGCAT 
TAGGTATATCTCCCAGGTTAATCCCTCCCCCCCTCCCCCCACCCCACAAC 
AGTCCCCAGAATGTGATGTTCCCCTTCCTGTGTCCATGTGATCTCATTGA 
ATTTCTTTAAAGGTGGAATCTCTCAGTGGGGTCTAATCTGTTCAGAAATA 
TCAAAAGAGTATCCTTGGGAATGACTGGAATTCCAGAGTCATCTGGTAAT 
CCTCATAAAACAACTCCTGGATGTCTCTCAGCACATCTCCCACCTTGAAC 

TTTTTTGGCCTAAAGTGCAAAAGGGGATACGTTTCATGTAAATAAATCAA 
CTGCAAATCGCTAGTTATGCTGAGCCCTGTCCCGTGCTGTGGACACAAAG 
GAACCAAAGGCTTTTCTCCC CGCCCAACACACACATAACACACACACAAA 
ATCATAAAAACATACATACCCCCAACACATAACAACACACAACACACACA 
CAAAATATATACACACAACACACACCAAACATGCCCACAAACCTGTGTCC 
AAAAATAAATCCTACTGGTGGGTTTGTGGTCTCCCTAACTTCAAAAATGA 
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AGCCGTGGACCTTCGCAbTGAGTGTTACAGCTCTTAAAGATGGCATGGAT 

CCAAAGAG7GAGCAG7AGCAACG777AC7G7GAAGAGCAAAAGGACAAAG 

CTTCCACAACCCAGAAGGGGACCCCAGCAGGGTTGCTGGTTGGGGTGGCC 

AGCTTTTACTTCCTTTTGGCCCCTCCCATGTTCTGTTTCCATCCTATCAG 

AGTGCCCTTTTTTCAATCCTCCCTGTGATTGGCTACTTTTAGAATCCTGC 

TGATTGGTGCATTTTACAGAGTGCTGATTGGTGCGTTTTACAATCCCCTT 

GT^AAGACAGAAAAGTTCCTGATTGGTGTGTTTTACAATCCTCTTGTAAGA 

CAGAAAAGTTCCCCAAGTCCCCACTGGACCCAGGAAGTCCACCTGGCCTC 

AC CTTTCAACTCCATAATGGCATGAAAATACAT ATGTTGTACAAAACATA 

CATACACAAAGTATACATGCATCTCCCCAAATATACACATACCACAGAAA 

CATACACACAGGAACTCAGCTACCTGTCAAAAGTCTGCATGGTGATTGCC 

TCTGCAGTGAGTAGTTAGAAAAGTGAATTTGTTTTTCAATAAATTGGAGT 

CCTTAAAAATCGTTGTAAGATAGAAAATTTTTAAAAGTATATAAAATAAA 

ATA7G7ATG7CCTT7GGTCTAGCATTTACACA7G7AGGAATTTATCCTAG 

TGGAGTAATCAATGATATATGCAAAGATTTGGACAAGCATATTAAGCACA 

GAAT7A7G7A7GCA7ATG7G7G7G7A7ATATA7A7ATA7C7GA7ACA7A7 

AATAATGTAAAAGTGAAAATAACTCAGATGTTCAAAATTGAGGATTAGTT 

AGACTATGATCTGTCCATATGTGACATACAAGTTAGCTGCCCCTTATTCT 

CTCGAGCTTCAACCTCCTATAAACAGTGTCCCTTGTATATCAGTATTGGT 

ACAGATAATCGAACTTATTGAGGTTTTACATGGGGCAATAAAGGCAAGAG 

TTTATGAATACTCCATACTACACTAGGTAGCACCCCCTATTAAAGACAAA 

CTCTTCTCTCTCATTTCCCTTCCTTTCCGGAACCACTTGGTTGAATCTCT 

ACAAG7C7C7A77GCAAC7GCC7CAACA7GGCACCC7CCC7GCA7C7CCA 

TCTTCCCTGTCCTGAGAGCAATGGCCTGCTGCCCCCACACTCACATCCTC 

ATTCATTCCAGAAGTGAGCACCACAGAAGTGCCTACAGTTACCCCAACCA 

CCTTCTTAGAAGATAAGTTAGTGTTTGTTTTGACTTTTTAAAATTTTTAC 

TTCCTCTTTTCCTTCACAATCTCATCCCATCCCAAGAGGTTTATCAAGAA 

GTTCTCTAAAGATATGTGTCTCCTTATGGAATTTAACAGAAATCAGGGAT 

TTGTATTCTAGCCATCAAGGGAATAACATTTTTCCAGGTCTTTAGACAAA 

TAATGGAATACCTTGCAGTAATTAGATACACTATTGTAGAAAAGTATTGA 

TGAAATGGAACGATGTTTGAGATATCATATTGAGTAGAAAAGGCAAGATA 

CATTAAGTAGGAAATGTATCTTACAAAATAATTTGTCAGACACACTCCTA 

TATTTGTATGTTATATAAATGCGTATGTGAAGAAAGGCTAGAGGATGAGA 

CCACAGTCTTCGGTGAAGTTTAAGAGATGAGGCTGCAGCATGCTCAGAAA 

GGCC7GGGTTATAGTTCTTCCAGTAATTAAGGATGTGATCTTGGGTAAAT 

TGTCCATCCTCTCTAAACTGCACCACCTTTTGTCTGTAAAACAGGAAGGA 

TGGTATTTACCCCCAGGGTCATCAAAGGATTTGGTTGGAGAAAAATAAAT 

AAATGGGCTGAGCCCAGACCTGGCACAGTGAGAGCACAGTGGTTGACTAT 

TGTGCTGGCC7GTTGTTCCTGTGTTATTGACATGCTGCTGGTGGTGGTCC 

AGAAGCTATTACCTTAATTGGTTATGTGGATTTCCCCTCATACTGAGCAG 

CTGTGTGTGGTGTTGTAAAACATAGCCATACACAGTAACTGACAAGGGCA 

AATGTGATGGAAAAATGCAAGGAAGTGCAGATAAATAGCTAATGGGCTGT 

AGAAGGAAGCTAGTCCTTGGAGGGCTTGATCAAGGAAGGTCCTTTTGCAT 

GTCACCTTTGAAGAAGAGGGGACATAGAAGAGGTATAGTGCATCCCGGAG 

TGTACCTGGAAGGGAACATGAAAAGAGGACATTTTTCTCTGGGACATGGG 

GACTCCACTTGCATGAACTCTGGAATTGGGGCAAAGAACCATCATGAGAA 

CAAGGGCTTCCTTGAACCTCCCAGGCTCATTGGCTGATCTAAACCCTGTG 

TCCCCTCTTTCCTTCACTCTCCTCTGTTTTCTATACCTGTATTATTGGAC 

TGGACTGGAAGCCACCTGATCTATCACAAGTACCTTGAAATGTGTTGAAT 

AGGTGTGGCACAGTCCTTAGCAGAGTGGCACTACCCCCACAGGAATTTGT 

TTATACCTTTGGCATGGAAAATAGCAGGAAATGAGTGATCACTGATAACT 

GAGGATGCTATTTATTATTGGCCAAAGGAATACTTGTGTTGTATTTGCAT 

AACCACTCACAAACTGTTGATTACAAATGAGTACCAGACCTAGCTCCTTC 

AAGTAAAGGATCCTGAGAACTGAAGGCAAACAGAGCTCCAGGAGTCCAAG 

ACAGAGCCACAGACCACGAGGATCCCTGGCCCAGGTAGGTGGTCCTCCTG 

CACTGGCT7TCAAGGCCAACAGGATGGATGGGGAAGTAGAGTAGCATCTG 

GCCATCTAGACCCTTGCTTTTTATCCCCACTGGAAGCACATCTGAATTTC 

TAAATATGATCTCTGAGACCTGCCCAGAACACCTTGCTCTCAGCCCCAGT 

AGCAGCCTGCTCTCTCCCAGGAGGGCTTCCACTAACAAGTAGGGCATTGC 

TGGAGGGCCAGGCAGACACTAGC7TAGGAAATCCACCAACCCTGGAAATG 
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CTAG7CCCTTCTCTGAAGGCTCAGAAGACTGACTTTAGAGTCTAGAAAAT 

ATTGGTCCTTGGGAACAGATTTTGAGTGCAAAGAGATGGACTTCAGATGG 

CCAGATGCACTGCTTCTTTAGGGAATTCTGTGAAAGCTCCC7GCATTTAT 

CTTAATACAGGCAGCAGATTTCATGAGTACCCCCGAGGGATGGCCCCAGG 

TCCTCCAGCCTGTGAGCATCCTTCTGTCCTTCAGCAGCACCACAGTATCT 

TTATATG7CTTTGGATACCTACGTTTCTGCCAGACATCTCTTGCTCTGAT 

G7TCTGGCTGCCAAATTCTCTGTCAAGCGCCTCCAATTTTTTGTGTCCTT 

TGATTTACCCCAACATGACAAAGGCAGTTGTGCTTCATGTATTCAGGGAT 

ACTGCCAAACCACAAACAGGTTAAAATCAAATAGCAGATATCCCTGTTCC 

TAAAGACCCATCAGCTCTACCCACCTGCTCCTGCTCACCGTCCTTATTGT 

TGAGTCCTGAAGCCCTTCCTTGTCATTTTTATTTTTTGCATGAACAATTT 

AGT7CC CTTTGTCTCACTCCTAAACCTTTCTCAAAGGATTGGATTTGTAC 

ACAAAC7GCCTATCTCTGCAATCTTAGAAGTGATATGATTCTGAACAAAT 

CAC77AAC7777GA777777A77GG7AAGA7GGGAA7ACCAA77777GC7 

CCAC77C7G7CC7A7G77GGCC7GGGC7GA7G77GAAAGC7C7CGG7CAA 

C7GAGA7AGGG7G7GCAGAA777A7A7A7A7AAA7A7A7C7€C7CCAACC 

CC7CCCAA7GAAGCAAG7CACG7GAG7CAA7CC7ACCC7AAGA7A77AGG 

GA77GAGCC7CC7GGGACA777GG7GGC77AGG7777CA7GAAAAGAGG7 

7GCAGAGCAAC7GC77777G77AGGCAAAGA77AGGC7AC7GCAGAGAC7 

CAGCAAAC77C7A7AGAAGG7G7CAGA7GG7AAG7A7777AGGG777GC7 

7GCCAGA7GA7C7C7CAAC7AG77AACCA7GC7A77G7AGCC7CGAAGCA 

GCCAGAGACAA7A7G7AAACAAGAGCA7GGC7G7G777CAA7AAAAC777 

A777AAAAAAACAG7CAGGGACCGGA777GGCCAAAGGCCA7AG7G7GCC 

AGCCCCAAGAC7AGAGCAA7GCAC7777AAC777777A7777A77777G7 

AAAA7GCCAAGA7CCACAAAAA7GC7A77GCACCCCG7G7G77AGCAC7G 

7GAC7CAAGG777GGGAAA77C7GC777GAAGGCG7GA7AGACAGGAGAG 

CA7GG7C7GGCCCC77GG7GCC777C7GG77GCAGCGAGCA777CAAAC7 

ACAGAGCAAGGCCAG7GG7C7G77CAGCAC7AGAGACA7GCAGCAAGG7G 

7CC7GGGG7GAGAAGA7GCCA7AAC7GG7CCCC777C7A7C7CC77AGG7 

C77GGAC77CA77CCA7777C7G77GAG7AA7AAAC7CAACG77GAAAA7 

G7CC777G7GGGGGAGAAC7CAGGAG7GAAAA7GGGC7C7GAGGAC7GGG 

AAAAAGA7GAACCCCAG7GC7GC77AGAAGG7AAGG77C77G7AGAAA7C 

7ACC7CAGGGCCAAAG7G7AA77CC7AGAGCAGAAC7T7GC7AGG7GC7G 

7GCACAGACCCAG77G777CC7GC7GAC77GCACAG7AAG7GAGC777CA 

AA777CCC7GGACAAA7AAC7AGACAAGAGAAA77C7GGAAGAGAAAAGG 

AAGC777GC77CAG7G7CCAGGCACA7CAGG7AG7AGA7AAAAGGA7CG7 

CC7CACC7ACAGA777GGGGC777AGCA7CC7G777GCCAAC7GGA7GG7 

7GCA7A7GC77CAAAA7GCACC7C77CCC7CCCAACA77CCCAAG7GGAA 

G AGAAGC C7C CGA7GAGAAGGAAC7C7C7AAGGC7GGGC7GAACAAA7GA 

CCCAGGCACAGGGCA7C7GAGTA77CCA7GAGGAACACA777GGG7G77G 

CCCA7GGGGGACAA7AGGAGGAGGC7777GACCCAAA7GA77G7C7AC7G 

AGG7G7GACGGGAGAGGCC7G7GACA7GCCAGAGGCCAAACCCG7GA7CC 

AG77CA7C7C7A77C7A7G77TC7GAAGAGGGAAGC7A7GA777AA7G7C 

A77AC7A7CA7GC7GC7C7AG7A777C7CAGCACA7ACACAGAAGAGGGA 

A77AAA7GG7CC77GA7ACCCC7AAA7CC7TGGAAAATCCGAA77GCA7A 

7GC7AACC7CAC7GCG7C7GAC7GCAGACCCGGC7G7AAGCCCCC7GGAA 

CCAGGCCCAAGCC7CCCCGCCA7GAA77T7G77CACACAAG7AAGGCC7C 

GGGG7GAGG7GA7GGGGG7GGC7GAGG7GCGAGGG7GGGGA7GGGGGA7G 

GAGCCA77GGG7CC7C77ACAGGG7GAGAGAA77G7AGAA7GGGGACACC 

7AAGGG7GC7GGA7GGGGC7GAAG7C777CC777G7GGAAGCAAA7CCCA 

77AGGAGA7AAC7C7GGGAAAGA7GAGCCCGGGGAGGGGCAGG7GA7GC7 

CACC7GC7AAGAGGCAAAGGGCAAGGAAGAG7TTG7GCC7GGGAACC77C 

CAGG7GCC7C77C7GACCA7AGCCAAGAGAC7GGAGACACAGACCTCCTC 

CCAGCAC7GAGGACAAACAGCCA7GGGGCCAG7GGGGG7GCAGGGACACC 

CACACCAC7AAGGGC7CAGGGCGGCGCC77CAGAGCC7GAACC77CC7C7 

CA7GC7GCCA777GAACACCACAACACCC7AA7AGGAAAC7G77AACA77 

GCCAC7G77CAGG7G7GGAAACCGAGACAGACAG7GGAGA77CCC7GCCC 

7AGG7GACACAGG7AA7AAG7GACAGA7G7GGAAA777AAAGG7AC7A7A 

ACG7C7G7C7GCC7GAC7CAGGC77AAGGC7CCCA7CACC7CC7C77C7C 

AGGACAGAG7CAGGAGGCC7CAGCC7GAGCCCCAGC7C7AG7GCAGG77C 
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ATGTGGGAATACTGAGCu rCACTAGTALrxATGGCAGAGAGGACCAAATGCS" 

GACCAGG7G7G7AAGGG7GCC7GGCACAG77GGGGGAGGC7GC7G7CGC7 

TCTCCACCGC7GCTGCTGCAGTTACCTTTGATGTTTTAGTTTTGTTGTAG 

TTACACCATTGCTGGCTTTGGATCTGCAC7GTGTCCACTCCAGGTGGAAC 

CACGCACACAAGCCTCTC7GTCGGGCCTGTCCTGACTTCTCCTTGTCAGG 

GCTGGGATCTCCTTCAAATCTGGCGGAAGTGGTTCTCCAAGTCTGGTCCT 

CAMCGTCAGCAGCATCAGCGCCTAGAAGTGTTAGGAATACACATTCCCA 

GGCCCCACCACAGACC7CC7GCC7CAGAAAC7CAGGGCGC7GAGGC7C7A 

GGGGCTGCTTTAACAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTC 

TGAGAGCTACTGCCCTACAGAAAGTTACTAGTGCCCTAAAGCTGGCGCTG 

GCACTGATGTTACTGCTGCTGTTGGAGTACAACTTCCCTATAGAAAACAA 

CTGCCAGCACCTTAAGACCACTCACACCTTCAGAGTGGCCTTGAGAAAGA 

TTTGGGGTCAAGGATCATGAGCGAGAACACCACTTAAGAGGATAGTGAAC 

TAGTCTGCATGTGAGACGCTGAGATCCTATGTCAGGCTGTGATAGGAGGG 

AAACAGAAACCAAAGGAAAGAACAGCTTTAAGAAGCGCTTAAGAGGTACA 

AAG7AAAA7GA7GG7GC7AGAAAAG7AGC77C77AAAAAGAGCA7777CC 

AGTCTCACCCTGGACTAACTGAATGAGAATCTCAGGAGTGTGAGGCCCAG 

GTATCCATGGTCTTAAAATGCCACCCACCAGGTGATTCCCAGTGTGCACC 

AGGGGTGAGAGTCACAGCCTTAGGCCATGCCACTCAAAGGGTGTCTTCAG 

ACCAGCAGCACCCACAGCTCTGGGAGTGCATCAGAAAGACAGAGGCTTGG 

CACCACCCACACt!TACTGAACCATAGTTTGCAGGTGATTTCTTGCACATT 

AAAGTGTGGGAAATGGAAAAGCTTAGAGTTCAGCTAGCTCGGTGACTCTC 

AGTCAACC7GCACCTGCTCCATGAACTCAGACTGCCTGGGATGGGCCCAG 

AAAAGC7CC7GAGGAGA77C7GA7G7AAGGCAGGGC7GA7AACCA7GGA7 

C7CA7C7GACCCCA7A7CAC7GGGGAG77AC77AGGA7C77GCC7GGGGC 

CAG7CA7C7C77CCA7AGACAC7GAGAG7G7CCACGA7GC77GGGGCAC7 

ACAGGG7GGGAGG7GGAGGA7CACGGG7GAG7CAGA7AGGAAGCC7GC7C 

C7GGGGAGC77ACAG7GC7A7AGGGCAGCAAGCCAAGGA7GCCAA7ACC7 

G7G7GCAGG7ACCAC7GACGAG7GCAGAGCGC7GCAGCACCAGAGAGGAA 

GC7ACCC7G7GCAGAGGGGGC7GAGGAGGGC7GCAGGGAGA7GACAGGAA 

AGCCGG7G77ACAGGAGGAG7CC7CCCCAC7C777GGGCA7GAGGAGACC 

AGGAGGACA77C7ACAG7GAGAAACCCAGGCAGAGGCCA7G7GC77A7GG 

CA7GGGAAAAGAA7GACACC77AGAC77A77C7C7ACA77AGAA77GCC7 

ACCACAGA7ACCCA7A77A7AGC77CACA7AG7G7GG7GG77AC7G7G77 

77CA7A77G7CACA777GCCA7777CCAGCCACCCACCCA77C77GACAG 

7CAC7GGCCCAGCC7GGGGGCCCC7G77C777A7CAAACAAG7GCC7GAG 

C7C777GCAGAGG7GAGGG7CACC7G7CCAA7CAGAGGCCAGGAGGGAAC 

G77CCC7777AAGACCC7AC7C7AGGCAGGCC7GGCCCAAA7GAG77GC7 

AGGAGCCCACGCCC7AAGAACCC7C7GAGCAC7G77G7GGC7GG7CC7GC 

7GC7AGAAG77G77CC7CCAGGGCCAGG7GCAAGA777G7GGC7777CAA 

AGGAGCCAC7AAAGC7CCAGC7CAGCC77GCACGG7GC7GGGC7CC7GGG 

GGC77CC7GCC7CCAACCC7CCCAAC7C77CCA7CACCGC7CCC77AGCC 

7GGCCAG7GCAGGGA7C7G77CCAC7C7AGGCAC7GC7GAGGGAA7GA7G 

CC7CCAGTCAGAGGG7GCAAAAAAGAGAG77AAGAAAAACAA7GA77A7A 

AAAAG7CC77777A7ACGCCAGACA7777C7T7GC7CAGGC7AAG7GC7A 

C77A777GAG7AAGCA7777AG77C7CA7AAC7CC7C7C7CAAG7AGG7G 

C7GC7A77AC777CA777CACAGA7GAGGACA77GAGG777GGAGAGAC7 

7AG7AAC77G7CC7C7G7CC7ACAGCAGAGC7GGGA777GAA7C7A7C7G 

7CCAAA7C7GGAACCCA777GC77GCACAGAAAGC77AA77GC77G7CCC 

AGCAAGA7AGAAAGCC7GGGAG7GGAAGAAA7A77CAG7GGC7G7GA7G7 

C7GAGCCCACAGGCAGGG7GGAGAGC7AGGGC7GGGGCCC77GGACG7GG 

GGAAGAAAGGGC7GAG7C77CCA7777CAA7G7GAAG7G77GA7A7C7GG 

TGA7A77GA7C7AGG7CCAAAGG7GAAGAAC77AAACCCGAAGAAA77CA 

GCA77CA7GACCAGGA7CACAAAG7AC7GG7CC7GGAC7C7GGGAA7C7C 

A7AGCAG77CCAGA7AAAAAC7ACA7ACGCCCAGG7GAC7C7CAG7777G 

GC7G7G7777C7GCC7CCACC7AGCAGGGG7AAGGCC7CCTGC7AGG7GG 

GC7CAAC7CCA7GC7A7ACCA7GCCCCA7C7CCAGCAGG7GG7GGAAGCG 

AGGAGGAGAGGCCCCAGGGAC7AGGGCA7CAGA7GAAGGG7C7C7AGCAA 

TGACCAGA7C7GAAAG7AG7C777C7GGAAGGGC7GGAGAAAAAGAAGGA 

GGCAGACAC77AGAC7GGAAGAAGAGGAGGC77AAACCGG7G7GA7GGAG 
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GGAGAAGTGGACCACAGAGTCAAGGGAGAGGGACTGTGCATCAGGCCTGA ' 
AACCCCAGCAGACAGGAGAGACCTTTCCCTGCTCTCAGAACCCACACATG 
TTCTGACTGTCTTTTTCCAGAGATCTTCTTTGCATTAGCCTCATCCTTGA 
GCTCAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAA 
GGGGAGTTTTGTCTCTACTGTGACAAGGATAAAGGACAAAGTCATCCATC 
CCTTCAGCTGAAGGTGAGAGTTCTAGCTCAGTTTCCTGGGCCTTTGGCTA 
CCCCAAAGTAAAAGGCCAAGATCCTCAATGCCTCTCGCTTTCCTGCAAAT 
TCTTATCTTGGCCAATATAACAGGGACATCCACCTTTCTGGAAGCACCAG 
GCAGAAGAGCCCCATAACTTCTTCTCTGGTTCCTTGCCCCTTCTAGGGAA 
GGAGGAGAGACTCCTCACAGCGGGGAGACAGCAAGGAGCTGAGCACCTGT 
TCTCCTCTCCTGGGCTCACTGGTCCTGGCCCTGGGCGGGTGGCGGTCCCC 
TCCTGCTGTGGCCCTCCATGTGGCAAGCAACACAATTGGGCCAGGACCCT 
GGCGTGCTGCTGTAGGGTAGGAGGGTGTGAGGGAGCACTCGGAGGGCAGT 
GTGTCTGCCCTGCAAATTTAGTCCTGGATGGAGCATCCTTTCACTTGAGG 
GGAGAAATCTTAGGAAGCTGAATTAGATACAGATCTAAGCCATATTCTCT 
AATTTTAAAAACTATAGAGCTGAGATTTTGGTATCCATCTGACTCTTACG 
TCTCTCTCTCTCTCTCTCTCTCTCTCAGTTTATTTTTAATCTGGGGGACA 
AGAAGGCCTGGAAAAGAGGGCATGATTGCTTATCATCCCTTAAATACCAG 
TACCAAGGCTGACACGTCATCTTTCCCAAGGACCATCTGCCTTCTCTCTT 
TTCCTCCTCTCCTGTGTAAAGGCCTGGAGGATGAGCACATGTGCTGTGTT 
TTCCTCCC7CTCAAAGCCTGTGCTATCTAATTAATCCCTTTTACCTCACA 
GAAGGAGAAACTGATGAAGCTGGCTGCCCAAAAGGAATCAGCACGCCGGC 
CCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGAACATGCTGGAGTCG 
GCGGC7CACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCC 
TGTTGGGGTGACAGATAAATTTGAGAACAGGAAACACATTGAATTTTCAT 
TTCAACCAGTTTGCAAAGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAG 
GAAACTGCCCCATTGAACGCCTTCCTCGCTAATTTGAACTAATTGTATAA 
AAACACCAAACCTGCTCACTAAACTTTCTGTCATTGGGTTTCATTTCTCA 
TTCATGCTTTAAGGATTTGTGTTTTTAGGATATAGCAAGAAGCTTGTTTA 
ATTACAAAGTTCTGGGTTGGAAAGAGACCGGCTTCTGCTTGTGTACTGCT 
ACCCTGAACCATCAGACATGCATGTGTGTGTCATATGCTATGATGTGGCC 
AGTCTGAGTGCAATACTTGCAGCGGGAAGGAGCAGCTGGGTGCATGCTGT 
GCTCTAGAATTAGTCTTTCCTACTGGGGTTTGGTAGATTCTGAGGGCATT 
GATCCTGGGGCAGAAGTGGCTGAGTCTGTGTCTAGGGTACAGTGTGCAAG 
AAAGAAATGTAACAGCAAGTCACAATCCAGCCAAGTGATAGTGGAAAAGG 
GGTAGTTAGGTCCCAGATAAGGAGCAGGGTGACTTGACCTGTGGGAAAGG 
CACAGAGACAAGGAATCTGGGTCAGATGACAGCCAGGAGACCAGGTGAGG 
• GAGGAGCCAGGTACTGTCTGGGAGGCTTGTCAACAAGGGCATGGTCCTAT 
CACTAAGCAGGGCTCAGATCCTCATAATGGGGGAGTGGAAGGCTGGCCGA 
ACAGAAATCAGGGCCTGGAAACAGAGTGAGGGGGTGGAGACAGGAGACTG 
AGGCTTGGAAATTAGTTTATTAGTTTTAGCTCTTCAGTTACAAGCAATAA 
TAATAGCTTCTAGCTTATTTAAGCAACAAGTATACTACAAAAGGAGCTTT 
CTAGAAGGATATTGGGTATATTCATTTCTTACTGCTGCTGTAACAAATTA 
CCACCAACTTAGTGGTTTAAACAATGCAATGTATTATCTTGCAGTTATGG 
AGGTCAGTCTGGAATGTGTCTCACTGGGCCAAAATCAAAGTATCAGCAGG 
ATAGCATTGCTTTGGGAGGCTCTAGGGGAGAGTCAATTTCCTTGCCTTTT 
CCAGCTTCCAGAGGCCACCTGCATTCCTTGGCTAGTGGCCCACTCCCATC 
TTCGCTGCTTGGGTTTTTCTCACACTGCTTTGCTCTGACCCTCCTGCCTT 
CCTCTTTCACATATAAGAACGCTTGCAATTTACATCGGGCTCACGTCAAT 
ATCCAGGATACTCTCCCGTCTCAAAGAGGCTTAACTTTAATCACAGATGC 
AAAGTCCCTTTTGCTATGTCATGTAACATATACACAGGGTCTGGGGATTA 
GAATGTGGACATTTTCGGGGTGCCATTATTCTGCCTATCATGTGAAGTAA 
CTTTCAAAATGGAAAGACATGCTGAAGAAAAAGTCAGGGATTTCTGGCAG 
GCCAGAAATGACAGAAGGCAGAAAACGTTGGTCCCATCACTCAGATGGGT 
AAGAGCCAATCATGCTTTTTGTCAGTTAGCAAAAGATTGAGATTCCAAGC 
AAAGCATGCAACTGCCCTAGTTTGGGTCATGTGTCGACTCCTTGGTCAGT 
GAAGGGCAGCACACCTTGATCAATACTCCCTCCAAGACTGTATCCAACGA 
GGCCAGTGATGTTCCTCAAAGCAGAGCTAGAGAGCTAATCCCAGGAGAGA 
GGCGTG7GGGTGGTGGGCAGGAAGACAAAGCTCAGCCGTAAAGGAGTAGT 
AGGGACAGCACCCTAGGCATGGAGGCTCAAGTGAGATGATACCCATGGGA 
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AAAGCTCTGATAAGGTC^JCTCCTTCTGTTTCTGATCCTGATGGTGATGG ,} 

TGATCAACACCAGCCCAGTGACAAAAAAGTACATAG7ATATTTAGTAGAT 

GTTTCCCACACAGAGAAATGGTAAATATTCAAGGCGAGGAATACTCCAAA 

CATCCTACCTTGATCATTACACATTCCGTGCATGTAATGAGTACTTGCAT 

GTATGCCATAAATATGTGAAATATTATGTATCACTATATAAAAGAAAAAA 

AAATGTGGCCAGGTGACATCCATATTTTGGAGAGGAAGGCATGTCTTCTT 

CATAATATCACAAAACTATTTTCACAACAAAGACACAGCTGTTCAAATTA 

GTCTCTGAGCCGGGGCTGTCTCATGGCAGTGAGGACTCTGGTTCCCTTAC 

AGACTAGCAGAAAGGAGATGGGGCTTACTGACCATGGCCTTGAGGAGGCT 

GAACATGCAGGCCAAATGGAGACACAGACAGCCTGGGCTTGGTCCTGCTC 

CATCCCCTTCCAACCTGATGAGATATAGTGAGTCACTATGACGTGGGTCA 

CTCATGCTTCCTGTGAGGCTCCACCAAGACAGCAAGTGCATCAACACCTT 

ACGGAAGCACAAGGCCCTGTTTGTTGTTGACTTCATGAAAGGCATGGTTG 

TGGTGATCGCATTGAGTAGGCTTTTGGGTGAGAGGTGAAAAACCCCAACT 

ATCATGCATTGCAGCCCTCTGGTGGAAACTGTGCTTCAGGCTCTAAATTT 

CAGGCTCTAGACTGACTCCAGGATGAGTATTTGGAAGCTGAAGTCAATCT 

GTGGTCTCTTCTCCTGTAGAGCAGGAGTCAGCACTTTTCATAGAGTGCCA 

GATTCTATATATCCTGCCACATGCTCTGTTGTTACAGAACAAAGAAGGCC 

ATAGACAGCATGGCTGTGTTGGCAAATACACAAAACAGGCAATAAGCTGT 

ATTTGGCCTTTAGGCTGCAGTTTGCCAACCCCTGCACTAACACAGAGCTT 

AAAGGTGGTGGTGGTGTGCTGGAGCTAGCTTATATCAGCTTGCAATAGCC 

AATTGCTAACATCTCTTCCAAACTCTGTGTCTGTGCCTTGATGTTGATAG 

TTTGAAAT7GGCTACCCCATTTAATGCTGCAATCTTTTCTCACCCCAGCA 

CTACTGACTCCCCTTTGCCCTGTCTTATTTTTCTCACTCTAACATGCTGT 

ATAGTTTTCTTCTTACATTTATTGTTTGTGTCTTCCACTAGCATGTATGT 

CCCACAAGTTCTTTGCTCTGTGATGTATCCCAAGAACCCACTGCAGTGCT 

TGGCACTTGTAGGAACTCCATAAGATTTTTATAAATGAAGAAAGGAAGAA 

AAAAGAGAGGGAGGGAAAAAGGAAAGGAAGCCTTCTATTTAAATGATGGC 

CTTCTCCATATTTCTATAGTAATATGACTTCCCTTGCAAAGGGGGATGCA 

TTTTGGAAAATGTGTATAAATAAACTCAGGTGGTTTTGAATTTCATTTTC 

CTAACTGTAATTGTAATCATTGGTCTTTATGTTTAGTGAAAAAGTTTTGG 

CCCTTATGCCTCACACCTGAGAATCCCAAAGTATTGGTTTGTTAGAGCTC 

CCATAGAGAACCATAAACTGGGTGGCTTAAAACAACAGAAATGTATCGTC 

TCCTGGTTCAGGAGGCCAAAGTCTGAACTCCAGGTGTTGGTTCATTCTGA 

GAGCTCTGAGAGAGAATCTGTTCCAGGCTTCCCTTCAGTTTGTGGTAGCT 

CCAGGGTTCCTTGGCTGGTGGCAGCAAAACTCCAGTCTCTGCCCCCATCT 

TCACATGACTGTCTTCTCTCTGTGTTTCTGTGTCCAGATTGTCCTATAAG 

GACAGAGTCATACTGAATTAGGGCTCACTCGAATGACTTCATCTTAAGTT 

3AACTGTATCTGTAAAGACCTTATTTCCAAGTAAGGTCACATTCACAGC7 

AC7GGGGGA7AGGACC7CAACA7A7C77777GGGGGACA7AA77CAAC7C 

A7AA7ACCCAACA7GA7AAC7G77CA7CCCA7GAAA777AA7G7C7C7CA 

AAAGG7GA7C7CAGGGCA7TTAA7C7G7GACAGAAAC7CCCA7AGGAAAC 

A77CCAACCAGAAGC7CC777CACAGC7GG7CAC7CC7CC7ACCCCA7CC 

GAGG7CC7GGGGCAGGG7GAGGCAGG7GGGGACAAGAAGAAGGC7G7C7C 

GGG7G7AGAAAGAGAAGACCC77A77CACCCGGCAC7C7G77CA7GAA7G 

AGC7A7CCAGCA7AGGA7A7AA7AAA7CGC777AGGAG7GG7AGAC7CCA 

AACA7777777GG7CCCAG77A7CC7AA7CAA77AAACAAAC7C7AGAAC 

CCA7C77GAAG7GCAGGCA77GGGACA77A7GAAAC77ACACAGAA77CA 

AAAA777ACAAGGGC7AAA7AAAACAGGG7C7GACA7C7AA7A7777C77 

CCCACA77CCCA7GCAC7G7C7GGC7CAACCA7CCCCAACCC7CAC7C7C 

A7CC7GG7GGACACA7GCC7AG7GA7G7GA7CAGC7GG77CACAGGGGGC 

7GG7GA7GG7GGA7A7ACAGC7777GCCAA777CCA7GGCA7AAC7AC7C 

CAAA7A7GGCCAA777CAAAC7ACCAACA7GAAGGCACAGACACAGAG77 

7GGAAGAGA7G77AGCAA77GGC7A77GCAAGC7GA7A7AAGC7AGC7CC 

AGCACAGCACCACCGC7ACC777AAGC7CC77G7G77AG7GCAAGGG77G 

GCAAAC7GCAGCC7AAAGGCCAAA7ACAGC77AC7GCC7G7777G7G7A7 

77GCCAACACAGCCA7GC7G7C7A7GGCC77C777G77C7G7AACAACAG 

AGCA7G7GGCAGGA7A7A7AGAA7C7GGCAG7C777AA7AAG7GC7GAC7 

CC7GC7C7ACAGGAGAACACAGA77G7C77CAGC77CCAAACA77CA7C7 

C7GAG7CAG7C7AGAGCC7GAAA777AGAC7GAAGCACAG777CCACCAG 
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AGGGCTGCAATGCATGA1AGTTGGGGTTTTCZACC7CTCACCCAAWVGCCT 

ACTCAATTTTTTACTGCAAAAACATGTTATCATCATTATTTTTTACTTAG 

CCCACCTTTCCTTGGCAATTTTCCATAGGAAAATGCATTCTAAATTTCAA 

CTAATCAGGGGACTTGGAGCCTCTGGACACCCCCTTGTTCCTTGCCCACA 

GTCCCTTGCAGAAGGTGCCTTATCAGAGCGGCTCCATGCAGGGGCTCAGG 

ACAGGATCAGATGTCAGTTGCACCAAGGGGGCAGGGACAGATCCTCTCTG 

CTSACCATGCAGAAGGGACTGTTCAGTGCACCGTCATGGTCCTGGTGATT 

TC7GGTCCATAAGGGAATTTTCACATGCATCGGGTGATTGTCACATCAGC 

ACAACACTGTGAGGAAGGCAGAGTGAGAATTTGTGTGCCCATTTTATAGG 

TGAGAAAACAGATGCAGAGACATTAAGTAACTTCACCACAGTCATGCGGG 

TTTTAAGTGGCAGACTTTCAGGTGTTGTGACTCCTAGTCCAGAGTTCTTT 

GCACTGCCCCTGAGGTGCTAAAACTCTACTGTGCTTTAAGACTCACTTGG 

GGAGCTTCCTAAAAAGAGAGATTGCACAACCTGAGATTCTTGTTTAACTG 

TTTTGGGATGTAGCTCAGGGATCTAGCTGCCTTAAAAAAAAAAACTCCCA 

AGTAATTCTGATGCAAGCGGTTCTTTTTTGTCCACCTTTGAAGAAACACT 

GCCTCCTCCCCATACATTTCATTAGAAAATGGTAACATGTTTTTCAGCCT 

GAGAGCCATTTCTGGGTGACCGGACGTCGGCAGCCCGCTGTACTAGCTTT 

CAGTCTAGGCTTAAACACACATGATAGGAGATGTCCTACTCCAGATGATA 

TGAGTCTGAACCATGGAAAAATTCCATTGTGTGGCACATCTGGTGGGTGT 

GCACTGTCCCCAGCAGTGAGGCACCCAGTQAAGACAGCAGCTGGGAGAGG- - 

CTTAGTTACATGCAGTGGGACAGTGTGGGCTAGACTGCTGAGCCCTCTGC 

AGTTTACTCTGTGTCAGGCAATGAGGGTGAAAGGCTGATCAGACCCACGT 

GCAGACCATACCCTCCAGGGAGACAGATATCAGTCAGGACAACCCCAAGT 

GTAGCT3GAGAAGCAGTGCCCAGGTATGACCGGATGTGTATCCAACCAGG 

AAATCTGCATATAAATATAAGAGGAGAAAATGAACAGATGTTGCTCTTAT 

ATGTAGATATTTATGAAGAGCATATAATTTTGTTTTGTGTGTTTTAAGAA 

GTTTATAAGTATGCCTTAAAAATGTATAGTATATACTGTAGGTATTTTTT 

CCATTAGATATTTTGTTTTTCATACTTATCCACATTGACATTGTAGCAAC 

AGTATAATATAACAACCTCCTCTACAAAAGCAGAAGGAAGTGAAGCTTTG 

GAAGGAAGCACCCAGTGAGCTTGCCCCTTTCAGGTGGGTGCAGTGAGCAG 

GAGTCAGTGAGGTTGAGATCCTTTGAGAGGAGGCAATCATTAACCAGGAA 

ATCTGCACTGCATCCTGGCCACACCTAACCCTTGGACAATGGTGCTTGGA 

GCGCCTTCCAGCTCTTAAGGCTTGCGATTTCTTTCTCTCACTCTTCACCC 

ACGATGATTAAATCTTCTCCTACAGAGTTGGACAATAAAGCCTTGAGTTC 

CTGCCTCCCCTGGTGTGATCACGAGGCATAGACATGGCCAGGAACATGTA 

GGTGTCTTTGAAAGCTGAACAAGTTAGTAAATTTCAAACCTCATTTCACC 

CACCAGTAAAATGGGAATAATAATAAACCTATTTTACATAGGGTTGACAA 

GAGGAG7AAAGAGGGATTCAATGAAAGTTCGTTATTATCATTTGTAGTAG 

CAGTGTTGATAATATCAACTGAAAGTTCATTATCATTATTAGTAGCAGTA 

TTGATAACCCTCTTTTCTGTGCCTTCTCACTGGTGGGCCCAGGCCATCAG 

CAATGCCCAGGGTGTCATGGATCTCTGCTGCATCGGGCACCAGCTGTGTC 

AATGGTGAGAACAGTACAAGGGTGGGCAGGGCAAGGCAGGAAGCACCCAG 

GAGCAGCAGCTTCATGGGGTGAAGATGTCAGGAGCTTAGGGACAGTCAGA 

GCGGGTGTGCCTCCTCTTGTGGAGCCTTTCTGCGTGGGTAGGAACTGCTG 

CAGCTGTGGCCATGGATTCACCTGAATATGGGTGGAATTAGGCATTCAGC 

TGGGTTAGCTGTGCCTAGAAGGAGGAACTCTAAACTGAGAACTTGTCCCT 

ATTGCCACCTCTGATAGGCAGATGATCCATCCATCAGTGGCTGAGCTGAG 

GTGTGCATGGGGATGGGTAAGAGCCCACACACAGGGCTGATGACTGAGTC 

TATTTAGAACAATAGATGTAAAATCTGATAATGTAAAATGTGATAGATTA 

TTTTGTCAATTAGAAATGGTACCATATAATTATATATATACATAAACATG 

TATACATATACACACATATACATGTGTGTATAAACACACACAGTATTGTC 

CCCTACTCATTCCATAAACCTGATGCCTTTAGCTGGGATTCCCAGCTTTC 

ACTCTCCTCTCTGTCATCTGCTGTCTATATCCTCCCCATCCTGTAATTCT 

GGCTTATATGCCACTTCCTCCCTAAAGCCCTCCCTCAATCCCTTGCTGGA 

AGTGACATTTTCCTCTTTGAGCTGCCCCTGCTTGTGCTTTGGTGAGGTCA 

GCTGTATTGCAGTACCTTGTATTGTGGTTGTCACATCATCGTATAGAATT 

AATTTCTGACACATTCCGTATTTTTCAAAGGGCCTAGTGTGGGGCTTTAA 

CAGTAACTACGCCACCACGCCCAGTTAATTTTTTGTATTTTTGGTGGAGA 

CAAGGTTTCACCATGTTGGCCGGGCTGGTTTCGAACTCCTGACTTCAGGT 

GATCTGTC7GCCTCAGCCTCCTGGAGTGCTAGGATTGCAGGCATGAGCCA 
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CTGCACCCAGCCACCTATCAAAATTTTAAGTGCCATTTTTATTTTTTATT 

^TTTGTAGAAATGGACAAGCTGATCGCAAAATTCACATGGAATTGCAGGA 

GGTTCCAAATAGCCAAAACAATCTTGAAAAAGAAGAACAAAGTTGGAGGA. 

-TTACACTTTCCAGTTTCAAGACTTAGCTCTTAGCTACAAAGCTACAGTA 

ATCAGAACACTATGGTCCTGGCATAAGTGATGCTGGACAGGTGAGCCCCA 

AAGTGGGACTTAACCTGTGAAGGTTCTTGGCCTTGCCCAGGAAGGAATTC 

AAGGGCAAGCCAATGGGACAAGAAAACAGCTTTATTGAAGGGGCAGTAT * 

ACAGCTCCAGCCCTGTTACAGCTCCAGCCCTGTTACAACTCTGACTACTC 

CTGCACAGAAGGGCTACCCTGTAGGCAGAGAGTAGCAACTCAGGGCAGTT 

TTGCAGTCATTTATATCCACTTTTAACACATGCAGATTAAGGGACAATTT 

ATGCAGAAATTTCTACGGAATTGGTAATAACTTTTGGGTCATGGAGTCAT 

CATGGAAGGGGGGCGGGGAACTCCCTGGTGTTGCCATGATGACGGTAAAC 

TGATATGGCGAACTGGTGGGTATGTCACATGAAAAGCTCCTTCCACCCCA 

GCCCTGTTTCAATTAGTCCTCGGTTTGGTCCAGTGTCCAAGTCCTGCCTC 

CAGAGTCAAGTCCCACCCCCTACCTCTTAAGGAGAGATGTAAATACATGG 

AATAGAATTGAGAGTCCAGAAATAATCTCATACATCTATGATeAATTGAT 

TTTCAGCAAAGGTGCCAAGACCATTCAATGAGGGAAAGAATCATATTTTT 

TTCAACAAATGGTGCTGGATAACCACATGTGAAAGAATGCAACTGGGCCC 

TTATCTCACACCATATACAGAAATTAACTCAAAATGGCTCAAACACTTAC 

ATGTAAGAGCTAAAACTATAATATTCTTAGAAGAAAACAGGGATATATCT 

TTATGACCTTGGATTTGCTGGCTGATTCTTAAATGACACTGAAAGCACAA 

GCAACAAAAGAAAAAAAAATAGGTAAATTGGACCTCATCAAAATTTAAAA 

CTTTTATGCTGGGTGCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGG 

CAGGAGGATCTCTTGAGCCCAAGAAGCTGAGGCTACAGTGAGCCGAAATT 

GTGCCACTGCACTCCAGCCTGGGTGACAGAGCAAGACCCTGTCTCGAATA 

AATAAATAAACAAATATATAATTATAGATCTCTGGATCTTGCCTTCGGAG 

ACTGACTCAACTAACTGGTCTGGGTGGGAGCCCAGCCATTTGTATTTTTT 

GAAAACTCTCCAAATGATTTTACTGTGCAGCCAAGGTTGAGAATCACTGT 

ATCATAGGGTTGGACTCCTAACTGGAAACAGTTTGCACCATCAGGTGTCG 

CAGCATTCTGATAATAGTTAAGCTTTCCTCCTAGATTTTCTGATATTAGA 

TGAGTCATGTTTACAAGTTTTTACCAAGAGACAAACTATCTTTCTGCCCT 

TACTTTCTCTCTTATACTATTCTAATCCCAGAACCCTTTGGAACTTCCAC 

TGAGAGATGAATCTAGAAAGTGACTCTCTTGGCTACAACAGAGAGTAATG 

TTGGCCTGTTTGTGCCAGATCCAGTTGGTGCTGGTGGTGGGACAGCACCT 

CCCTGAAATCCCCTCCTCTCCCGTCAGATTCAGTCCCCCATTTGCATCAC 

GTACAATCATCACTATGGGTTTCTATTACCTTGCTAGGGCATTTGGAGGT 

ACCATATATACCAACTATTAGTTTTGAGCCATGGTTCCCAAAGTGTGGAC 

TGTAGGGCACCTCAGCACACTCACGAGGTGTCATGGGATATTTAAATATT 

CTGA^GAAAACACAGTGACATCTGTCAGGCCCGTGAAAACCGTTGGCATT 

AAATTGTCTCAACCCAATTGCTTAAGAAGCAGAACTGGCCAGGCACGGTG 

GCTCACATCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCAGATCAC 

GAGGTCAGGAGTTCGAGACCAGCCTGACCAACATAGTGAAACCCCGTCTC 

TACTAAAAATATAAAAATTAGCCATGCATGGTGGCATGCACCTGTAACCC 

CAGCTACTCAGGAGGCTGAGGCAGGAGAATTGCTTGAACCTGGGAAGCGG 

AGGTTGTAGTGAGCCAAAATCGTGCCACTGCACTCCAGCTTGGGTGATAG 

TGAGACTACATCTC^AAAAAAAAAAAATGAGAGAGAGAGAGAGAAGCAGA 

ACCATCAGGTGTTTCTTTTGGCTTAAAGTACTCTGTGAAGAAATTCCTGG 

GACACGAAGGATACCATGAACTGAGAGATTTTGGGAACCTCTGCTTTAGA 

AGCTGGAGGTAGCATTCCTTGGGCACAGTACTGCCTTGGGATCAGCAAAT 

CCTTTTGATGGTGCATTTAGGTGTGGCAAGACAGCTCTTAGAGTGGGACC 

GGGATGTGCTTGGAGACAGAGGGAACTAGATTGAGCTGCCCGATAAAGAC 

ATGCCAGCCTGGCAGAGTGTAGTGACTCATGTCTGTAATCCTAGTGCTTT 

GGGAGGCTGAAGTGGGAGGATTGCTTGAGGCCAGGGGTTTGAGATCAGCC 

TGGGAAACAACAAGACCTCTACAAAAAAAAAAGAAAAAAAAAATTAACCA 

CATGTGGTGGCATGCACCTGTAGTCCCAGCTACCTGGCAGGCTGAGGTAG 

GAGGATCACTTGAGCCCAGGAAGGTAAGGATACATTGAGCCATGACTGTG 

CCACTGCACTCTAGCCTGGGTGACAGAAAGAGACTCTGTCTCAGAAATAA 

ATTAAATAAATAAATAAATATATAGTGGCCATGACATCCCTAGAAAGACA 

AGGTCCTGGGAATAGGTAGAAGCCAAGGGAAATGAGAAATGAGAGGGGGC 

CCTGGAGCTGGAACTGGGGGAGCAGGATGGCCTCTGAGAAGTTCCTGATA 
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GTGGTGTCACTGATGTGTCTGATGTTTAGTTGTAATTATTTGCTGGGCCC 

CTGTCATCCCTCATATCTGATAGCTCTTTGCTAGTCAAAGTGTGGTCTGG 

GGATCAGCGGCATCAGCATCACTTGAGAACTTGTTAGAGATGCAGAATCT 

AGAGCCCCACCCGGGACCCAGAAACAGAGCCTGCATTTTAACAAGCTCCC 

CAGGTGATTCTCACACACACTCGCATTTGAGAAGCACTGGGCTAGTTGAC 

AGATTCTCAGGCATGGCTGACATTGAAATATCCAGGGAGCAGGCTTGGCA 

TTAGGATGTTTAAAAGTCCTCCAGGTGTTTCTAAAGCCAGG7TTGAGGAA 

TTACTGGGCTGATACAAATGTTTTGTGATGATGCTTTGTGTGTGTGTGTG 

TGTGTG7GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTAGGGAATTC 

TGGGTCACTTGGCACCAACACAGGAAACAATGGAAATATGTGAGCCATGA 

CAGAAAGGTCAGGAGATAAAAGAAATTAGTGACATGAGAGGTACTCCTCA 

GGTGTTAGGAAAGAGGGTAGAGCAAACCAGGTTTTCCACCATATGTTGGA 

TAGGGGGTCAAGTAAATTTCTACTTAAAAATTACAAACAGGGGCTGGGCG 

CGGTGGCTCATGCCTGTAATCCCGCACTTTGGGAGGCTGAGGAGGGCGGA 

TCACAAGGTCAAGAGATTGAGACCATCCTGGCCAACACGGTGAAACCGTG 

TCTCCACTAAAAATACAAAAATTAGCTGGGCATGGTGGTGCGTGCCTTTA 

TTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCTGGGAG 

GTGGAGGTTGCAGTGGGCCGAGATCGCACCACTGCAATCCAGAGCGAGAC 

TGTGTCAAAAAAAAAAAAAAAAAGAAAATTCCAAACAGGATGACCCTAAG 

CCTGCAGGACTTGGAGACATCTAGGTGACTGATACTCAGTCACAAAACAT 

AATTGGTCACAGGCCTGATGAAATGCACAGCAGACCTTCAGATGGTATGC 

ACTCAAGTGATATCCACAAGTCCACCTAAAGAAATGCTATATTCAGACAT 

TTGGCATCAATCTCTATCAAACAAAGATAGTCCAAAGCAATGGGTTCCAA 

AAACACTTTCCTAAGACAAATTCTCTATTTGCTTTTAATATCAGTCATCC 

CAGCCCTTGGAATAGAGGAGCAAATGATACCAGTGGTACCCTACCACAAT 

GCACCAAGGTATTATACTCTCATGCTCCATTTTCTCCCTCTGTCTACATC 

ACTAATAACTCATTGATTTCTGGTGCAAGCCCTGCTGGGAGAAAAAGTCT 

ACTCTTGTACCTTGGAGCAAGTTGCTCAGAGTAGGTATCGAGGATAAAAT 

TTGGAAAGTTAGAAAAGCTATTAGAAGGAGATCCTAGTAGTTGAAAACAC 

AGCCTGGCCAAGTCAATGATGCTATTTCATCTCCCCAGCCTTGCATGTCC 

ATAGCTAAGGAAGACAATTTAGGCTTGGGCTAGAGGATGGGAAAGGGCAA 

AATTACTGATGCCACAGCCCAGAGAGGTATTCTAGTAATCTGAGGGTGAG 

GACCACATACCTGGTTCAGGGACGTACAGTGTTGACAGCTGTGAGTGGAT 

GCCTGGAGTTCTGGCGTGTCTTCTAGCACAATGATACCTGAGACTCTTGC 

ATCATTGGGAATAATAAAATGGGAGTGGATAGATATGAAATTATGATGGC 

AATAAGCAATCAGCTAATAGCTTCATTGATGGGACAGATTAAAGATGGCT 

GCAAATCCTTTGGTCCAGGTTTGGGATATAGGCAGCATTTGTATTGGAAT 

GCTGATAGTCTGAGGCCATGAAAAGTCCACCTGCAGTAGTGGTAGGAGGA 

ACAAGCCTCACTTTCTTCAATGTGTGTGACTGCTGTCTTGATTCCCTGGG 

TGGCCAGTTCCATTCGTGTGGTTCTTTGGTCCACTTGACTCTGGGGTGGC 

TCTGTGATGGCTTGACCAATACAATGTAGTGGAAATGATGCTGTCATCAT 

TTCCAGCCTCTTCCAGCCTTAAGGAACTGGCAACTTTTATTTCTGTCCCT 

TGGAATACTTGTTCTTGCAACCCATCCATCATACAGTGAGAAATTCTAAG 

CTGCCCCATTAAGAGGCCCACATGGTGATAAATTGGGGTCTTACATACAG 

CCCTAGCTGTGCTCCTAGCTGACAAACAGTAGCAACTTGTCACCAGGCGA 

GTGAACCACTTAGGACTGTATACTCCAGCCCCAGTTGAGCAATGTGGAAC 

AGAGTAAACCATCTCAGCTTAGCCCTGCCCAAACTGCAGAATTATGAGCA 

AAATAATCCCCTAGGCTTTGGGCTGATTTGTTCCAGATTACTGGAACAGA 

ATTTGGTACCAGGGGTGAGGTGCTACAGCAATGAAAGCTTAAGACACGTG 

ACTTTGGTTTTGGGTCTGAGTGGCAGGGGAACTTGGCAGGCCTCAAGGAA 

ACTTTTAGGGAGGGTTGAAGCATAGTGAGGAAAACAGTAGGGGAAGCTAG 

AGGAAAAAATGATGCTTGGTATGTAGTGGTGGGAAGTTTAGCAAAACTCG 

CCTGATGTAATGTGGGAAATTGTAAGAACTCAGAACGATTTAAGGGCATG 

TTTTATAGGTCCTTTAAGAAACTTCTAGGCCAGGCGCAGTGGCTCATGTC 

TGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGCGGATCACAAGGTCAGG 

AGATCGAGACAATCCTGGCTAACATTGTGAAACCCCGTCTCTACTAAAAC 

TACAAAAAAAAATTAGCCGGGCATGGTGGCGGGTGCCTGTAGTCCCAGCT 

ACTAGGGAGGCTGAGGCAGAAGAATGGCGTGAACCTGGGATGTGGATCTT 

GAAGTGAGCCCAGATTGTGCCACTGCACTCCAGCCTGGGCAACAGAGTGA 

GACTCCGTCTCAAACCGAAAAAAAAAAAAAAAAAAAGAAACTTCTAGGGC 
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TGGTCCCGTGGAAGCCTCACACATGGTACACAAAGGCTGTCTTGAAAAGA 
AACGTAAGTGTGTTTTTTGGTTTAATAAAATTGATTATAAATGGATAATG 
CAAAACATTTTAAAGAATTTTACTAGCTTACATTAGCAGATTTGGATCCA 
GTGATTGTTACATTCTGGTACTGAGCCCCTGAATTACTTCTTTGAGTAAG 
GCATTATACCAAAGCTATTGATAGTTGGGCTTATAGGGTGTATGTTTGAA 
GAACTACTAATGTCAAAACCAATATTTCACGGTCGACAAGAGGACATCAG 
AA.CTGGTAATCCTTATTACCATGACTGGCTGGACAGAATACTCAATGTAA 
TfeGGATTTCCTGCAAATAAAGACGGGGAAGATGTAAAAAAGATGCCTGAA 
CATTCAACATTAATGAAAGATTTCAGAAGAAATATGTATACTAACTGCAG 
CCTTATCAAGTATATGGAAAAACACAAAGTTAAACCAGATAGTAAAGCAT 
TCCACTTGCTTCAGAAGTTTCTTACTATGGACCCAATAAAGTGAATTACC 
7GAGAACGGGGTCCCTGTTTCTTCGAAGACCCACTTCCTACATCAGACGT 
TTTCAACAGTTGTCAAATCCCCTACCCAAAATGAGAATTTTTAACAGAAG 
AAGAACCTGATGACAAAGGAGCCAAAAAGAACCACCACCGGCAGCAGGGC 
CATAACCACACGAATGGAACTGGCCACCCAGGAATCAAGACAACGGTCAC 
ACACAGGGACCCCCGTTGAAGAAAGTGAGGCTTGTTCCTCCTACCACTAC 
CTCAGGTGGACTTTTCACGGCCTCAGACTATCCGCGTTCCAATCCACATG 
CTGCCTATATCCCAACCCTGGACCAAGCACATCCCAGCCGAAGAGCAGTG 
TAGGATACTCAGCTACCTCCCAGCAGGCTCCACAGGACCCACGTCAGACA 
CACGGGTACTGAGCTGCATCGGAATCTTGTCCGTGCACTGTTGTGAATGC 
TGCAGGGCTGACTGTGCAGCTCTCCGTGGGAACCTGGTATGGGCCATGAG 
AATGTACTGTACAACCACACCTGCCCAGTAGCCAAGTTCCTTCCACCGCT 
TTTCACAGATCGGGGTAGTGGCTTCCAGTTTGTACCTATTTTGGAGTTAG 
ACCTGAAAAGAAAGCGCTAGCACAGTTTGTGTTGTGGATTTGCTACTTTC 
; ATAGTTAACTTGACCTGGCTCAGACTGACCAGTACTTTTTTTTCCGTGAC 
AGTCTATAGCAGTTGAAGCTGAGAATGTGCTAGGGGCAAGCGTTTGTCTT 
CATATGTCATGAATTCCTCCAGTGTAACAACATTATCTGACCAATAGTAC 
ACACACAGACACAAGGTTTAACTGGTACTTGAAAACATACAGTAGGTGTT 
AACTCAGTGAAATAACCAGGACTCAAAGTAAGATTATTTTGGTACACCTT 
TCTTGTTAGTGTCTTATCAGTGAGTTGATTCATTTTCTACATTAATCAGT 
GTTTTCTGACCAAGAATATTGCTTGGATTTTTCTGAAAGTACAAAAAGCC 
ACATAGTTTTTTTCAGAAAGGTTTCAAAACTCCTAAAGATTAATTTCCAA 
GTATAAGTTTGTTTTTATTTTCAATCTATGACTTGACTGGTATTAAAGCT 
GCTATTTGATAGTAATTAGATATATTCTCATTGATATAAACCTGTTTGGT 
TCAGCAAACAAACTAAAATGATTGTCACAGACAATGCTTTATTTTTCCTG 
TTGGTGTTGCTTGTGGGAAAAAGAAAGAGAGATCAGATTGTTACTGTGTC 
TGTGTAGAAAGAAGTAGACATAGGAGACTCCATTTTGTTCTGTACTAAGA 
AAAATTCTTCTGCCTTGAGATGCTGTTAATCTATATAACCTTACCCCCAA 
CCCTGTGCTCTCTGAAACATGTGCTGTGTCCACTCAGGGTTAAATGGATT 
AAGGGCGGTGCAAGATGTGCTTTGTTAAACAGATGCTTGAAGGCAGCATG 
CTCGTAAGAGTCATCACCACTCCCTAATCTCAAGTACCCAGGGACACAAA 
CACTGCTGAAGGCCGCAGGGACCTCTGCCTAGGAAAGCCAGGTATTGTCC 
AAGGTTTCTCCCCATGTGATAGTCTGAAATATGGCCTCGTGGGAGGGGAA 
AGACCTGACCGTCCCCCAGCCCGACACCCGTAAAGGGTCTGTGCTGAGGA 
GGATTAGTATACGAGGAAGGAACGCCTCTTTGCAGTTGAGACAAGAGGAA 
GGCATCTGTCTTCTGCCCGTCCCTGGGCAATGGAATGTCTCGGTATAAAA 
CCCGATTTTATGTTCCATCTACTGAGATAGGGGAAAACCACCTTAGGGCT 
GGAGGTGGGACATGCGGCAGCAATACTGCTCTTTAAGACATTGAGATGTT 
TATGTGTATGCATATCTAAAGCACAGCACTTAATTCTTTACCTTGTCTAT 
GTTGCAGAGACCTTTGTTCACGTGTTTATCTGCTGACCTTCTCTCCACTA 
TTATCCTATGACCCTGCCACATCCCCCTCTCCGAGAAACACCCAAGAATG 
ATCAATAAATACTAAGGGAACTCAGAGGCCGGCGGGATCCTCCATATACT 
GAACGCTTGTCCCCTGGGCCCCCTTATTTCTTTCTCTATACTTGGTCTCT 
GTGTCTTTTTCTTTTCCAAGTCTCTCGTTCCACCTAATGAGAAACACCCA 
CAGGTGTAAAGGGGCAACCCACCCCTTCATTGCTGATTTGTGAGCGTGCT 
TTAAGGTGAAAAAAGCATGAATGTTAACTTCCTTAAAAAGGTACAGCATC 
CAATTCAAATATTTTTGTCCTGATTTTAATGCTAGTTGATGTAGTGCTAT 
T AAAATTTTGTTCAACATGGACACAGAGAGGGGAACAACACAT AC CAGGG 
CCTGTTGCGGGGTGGGGATGAGGGGAGGGAACTTAGAGGACAGGTGAACA 
GGTGCAGCAGATCACCATGGCCCACATATACCTATTTAACAAACCTGCAC 

FIGo 4 (60 of 61) 

IN I Hf 



WO 99/06426 



PCT/US98/16102 



GTTCTGCACACGTATCCCATTTCTTTTTTTTTTTAAGAAATAGAAAAAA^ 
AATAAAATTTTGTTCACTGATTCTTCCATTTTAAAACTTGTTTGCATGTG 
GTTTAGGATGCCCTTACTTCAGCAAAGGAGAAGGAATAGGAGGGCCTTAG 
AATTTTTGAGGGAAAAAAACCCTATAACATACATTGTACTGTATCAAACT 
ATTTTACATGAATGACACAAGTATTCTGAATAAAAAAATAATTGAACATT 
GTTAAGAACAAGGTGTCATGTAATTTATTTTTCATAAATAAAAAAATTAT 
AG^GGCTTAGACTGAAAGGAACAGAGAATTTAAAAAATTAAAAAGAAGCC 
TTAGTATATTTTTGTATATAGTTTCCATGTGCCATATTTGCCATAATTGG 
ATGAGAATTTTTTGACCTCTGGCAGGGTGACCCTATATTTTCANTNTATA 
AAGCGTGCATCATACC 
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